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CHLAMYDIA TRACHOMATIS GENOMIC SEQUENCE AND POLYPEPTIDES, 

FRAGMENTS THEREOF AND USES THEREOF, IN PARTICULAR 
FOR THE DIAGNOSIS, PREVENTION AND TREATMENT OF INFECTION 

5 

The subject of the invention is the genomic sequence and the nucleotide sequences 
encoding polypeptides of Chlamydia trachomatis, such as cellular envelope polypeptides, which are 
secreted or specific, or which are involved in metabolism, in the replication process or in virulence, 
polypeptides encoded by such sequences as well as vectors including the said sequences and cells or 

10 animals transformed with these vectors. The invention also relates to transcriptional gene products of 
the Chlamydia trachomatis genome, such as, for example, antisense and ribozyme molecules, which 
can be used to control growth of the microorganism. The invention also relates to methods of 
detecting these nucleic acids or polypeptides and kits for diagnosing Chlamydia trachomatis 
infection. The invention also relates to a method of selecting compounds capable of modulating 

15 bacterial infection and a method for the biosynthesis or biodegradation of molecules of interest using 
the said nucleotide sequences or the said polypeptides. The invention finally comprises, 
pharmaceutical, in particular vaccine, compositions for the prevention and/or treatment of bacterial, in 
particular Chlamydia trachomatis, infections. 

The genus Chlamydia is composed of four species: Chlamydia psittaci. Chlamydia 
20 pecorum, Chlamydia pneumoniae and Chlamydia trachomatis. 

Chlamydia psittaci comprises numerous species, whose hosts are terrestrial vertebrate animals as well 

as birds and occasionally humans; 

Chlamydia pecorum is a pathogen of ruminants; 

Chlamydia pneumoniae is responsible for pneumopathies, for sinusitis and for arterial impairments in 
25 humans; 

Chlamydia trachomatis (Ct) is responsible for a large number of human diseases: 

eye diseases: conventional trachoma, nonendemic trachoma, paratrachoma, inclusion 
conjunctivitis in neonates and in adults; 

genital diseases: nongonococcal uretritis, epididymitis, cervicitis, salpingitis, perihepatitis and 
30 bartholinitis as well as pneumopathy in breast-feeding infants; 

systemic diseases: venereal lymphogranulomatosis (LGV). 
These diseases affect a very large number of women and men [more than 600 million individuals are 
trachoma carriers and there are more than 90 million cases of genital Chlamydia infections] 
worldwide. Accordingly, basic and applied research which makes it possible to understand the 
35 physiopathology linked to this bacterium is very important for public health. (Raulston JE., 1995; 
HackstadtT.etal., 1996). 

Eye impairments due to Chlamydia trachomatis cause trachoma and inclusion 
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conjunctivitis. Trachoma is a chronic conjunctivitis. It is the major cause of curable eye diseases 
leading to blindness. It is estimated that 20 million cases of loss of sight are due to it worldwide. 
Moreover, inclusion conjunctivitis is an eye inflammation which is caused by Chlamydia trachomatis 
and is transmitted by the venereal route. Inclusion conjunctivitis affects adults and neonates exposed 
to genital secretions. 

Two types of eye disease caused by agents of the species Chlamydia trachomatis can 
be distinguished. The conventional trachomatous disease is found in endemic regions; transmission 
occurs from eye to eye and through the hands, or it can be passed on by flies. In nonendemic regions, 
transmission occurs through the genital apparatus; it usually only causes conjunctivitis, most often 
without associated keratitis; it is rare for a pannus or for scars similar to those in trachoma to develop. 
This conjunctival impairment is called paratrachoma to differentiate it from the conventional endemic 
trachoma which is transmitted by the ocular route. The seriousness and the number of cases of 
trachoma have decreased over the last forty years. This is related to the improvement in hygiene and 
living conditions. However, trachoma remains the principal cause of avoidable blindness in Africa, in 
the Middle East and in some regions of Asia. The transmission of the endemic disease occurs in 
particular through close personal contact, in regions where a secondary exposure exists in a repeated 
form. Often, the infection is also latent. In some industrialized countries, such as the United States, a 
mild form of trachoma still exists in some ethnic groups. Sometimes, a tardive trachoma may be found 
following an immunosuppressive treatment. The eye impairments caused by Chlamydia trachomatis, 
such as inclusion conjunctivitis and paratrachoma, are also a complication due to a common venereal 
infection. These infections are not very frequent; they occur most often in young adults. The eye 
impairments in neonates are produced during the passage through the maternal genital routes during 
childbirth. Theoretically, endemic trachoma and inclusion conjunctivitis in adults appear in the form 
of conjunctivitis, the latter being characterized by the presence of lymphoid follicles. In regions where 
the endemic disease is serious, the disease often starts before the age of 2 years and reinfection is 
frequent. Superficial neovascularization is added, in this case, to leukocytic infiltration. The 
conjunctival scars will then cause trichiasis and entropion. The eroded cornea will become a carrier of 
a corneal ulcer of bacterial origin. The scar on the cornea causes blindness. Impairment of the 
lachrymal glands gives a picture of dryness of the cornea. Xerosis becomes complicated with 
secondary bacterial ulcer. In regions where trachoma is endemic, the infectious process disappears 
towards the age of fifteen. The scars then progress to blindness, which affects almost exclusively 
adults. In regions where exposure is lower, the infectious process is, in this case, less rapid and adults 
are carriers of a chronic disease. 

Positive diagnosis of trachoma can be most often established by clinical observation: 
lymphoid follicles are visible on the upper tarsal conjuctiva; conjunctival scar is typical. Vascular 
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pannus exists. In endemic regions, clinical diagnosis is often sufficient. However, isolated cases of 
inclusion conjunctivitis must be the subject of a differential diagnosis, in particular to distinguish 
viral conjunctivitis. 

Public health measures against the endemic form of the disease provide for mass 
5 treatments with tetracycline or erythromycin collyria of all children. The treatment may also provide 
for surgical correction of the lesions. The other conjunctival impairments respond well to general 
treatments with tetracyclines or erythromycin. The prevention of trachomatous disease by health 
measures and by improving living standards is sufficient. Furthermore, to avoid the spread of 
trachoma, antibiotic collyria may be used. 
1° The role of Chlamydia trachomatis in a number of genital impairments has been 

demonstrated over the last three decades. Chlamydia trachomatis is responsible in this case for a 
pathology which may be superposed on the impairments observed with Neisseria gonorrhoeae. The 
pathologies for which Chlamydia trachomatis may be responsible at the genital level are acquired by 
the venereal route and are a major source of sexually transmitted diseases. 

15 The epidemiology of Chlamydia trachomatis genital infections shows each year more 

than 4 million new cases in the United States, and more than 3 million new cases in Europe. Like the 
other venereal infections, Chlamydia trachomatis affects young subjects. There is a direct relationship 
between the number of sexual partners and the frequency of the disease. For example, the frequency 
of Chlamydia trachomatis appears to be five to ten times higher than that of Neisseria gonorrhoeae in 

20 pregnant women. The Chlamydia trachomatis infection is probably more discreet than its Neisseria 
gonorrhoeae homologue. This relative clinical silence, estimated in women at 50% or even 70% of 
infections, explains why the total morbidity of Chlamydia trachomatis conditions is high. Diagnosis 
must therefore be requested in patients who are sometimes asymptomatic carriers of infection. 

Chlamydia trachomatis is responsible for nearly 30% of nongonococcal urethritis, or 

25 NGU. Chlamydia trachomatis urethritis may be discreet, the disease then progresses to a certain form 
of chronicity. The diagnosis will, like for the other clinical forms of the disease, be called into play 
later. 

Chlamydia trachomatis is a cause of epididymitis in humans during a period of sexual 
activity. The bacterium may be found in the urethra, urine, sperm or even a sample collected by 
30 aspiration from the epididymis. It is in particular found in humans under 35 years of age. A discharge 
from the urethra which is associated with the disease suggests the diagnosis of a Chlamydia condition 
or sometimes a gonococcal condition. 

Untreated Reiter's syndrome, if accompanied by urethritis, evokes a Chlamydia 
trachomatis condition. 

35 Chlamydia trachomatis affects 30% to 40% of women who are clinically carriers of a 
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gonorrhoea (or have had contact), 10% to 20% of women having a venereal origin, 5% of women 
consulting having no particular origin. 

The cervix is often normal during a Chlamydia trachomatis infection. However, a 
hypertrophic cervical erythema will cause such an infection to be suspected. Chlamydia trachomatis 
5 is responsible for an endocervicitis whereas viral impairments result in exocervicitis. A 
nongonococcal endocervicitis requires treating the patient and partners with tetracyclines. 

Chlamydia trachomatis is responsible for a large number of acute salpingites. The 
picture is often complicated by an acute peritonitis or even a perihepatitis. 

In case of pregnancy, the risk is first that of infection of the neonate at birth. 
1 0 However, the risk of postpartum complications exists (endometritis or salpingitis). 

The reference method for the diagnosis of Chlamydia trachomatis is the isolation of 
the bacterium on cell culture. For all infections, the sample collection should make it possible to 
obtain a suitable sample with the aid of a swab. This sample should be transported to a laboratory 
under excellent conditions; in particular, the cold chain must absolutely be maintained. The placing in 
15 cell culture on mouse fibroblasts will be carried out by people having specific skills. The distinction 
of Chlamydia trachomatis with labelled antibodies and the observation of cell cultures under a 
microscope will take place two days after placing in culture. Provided these imperatives are observed, 
cell culture is a reliable technique. However, the constraints linked to this technique are many: not 
only must the laboratory be equipped for the cell culture, but, furthermore, highly competent staff 
20 must take care of this type of diagnosis. 

Techniques for identifying genetic material can obviously be used for the detection of 
Chlamydia trachomatis. Among these techniques, enzymatic gene amplification or PCR is favoured 
by those skilled in the art. The technique indeed makes it possible to identify Chlamydia trachomatis 
with a very high sensitivity and complete specificity. Initially used in specialist laboratories, PCR is 
25 now performed in numerous medical laboratories. This diagnostic approach is important because it 
allows detection of the bacteria even in samples which have been transported under poor conditions. 

The treatment of Chlamydia urethritis with antibiotics such as tetracycline or 
quinolones is very effective. The duration of treatment varies between 7 and 14 days. The treatment of 
pregnant women poses the problem of contraindications to tetracycline. 
30 Neonatal infections caused by Chlamydia trachomatis are explained by the frequency 

of these bacteria in the cervix. In some studies, 5% to 13% of impairments are observed in the cervix 
in asymptomatic pregnant women. The neonates risk, in this case, developing an inclusion 
conjunctivitis. Not only can Chlamydia trachomatis be isolated from the children's eyes, but also 
persistently from the rhinopharynx and also from the rectum. Pneumopathies and otitis media are also 
35 found, a result of contamination at childbirth. 




i 
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Differential diagnosis of inclusion conjunctivitis in neonates is required with 
gonococcal ophthalmia; while the duration of incubation is from one to three days in the case of a 
gonococcal ophthalmia, neonatal inclusion conjunctivitis has an acute beginning with discharge and 
formation of membranes or even of conjunctival scars. 
5 Treatment consists of oral erythromycin at the dose of 40 to 50 mg per kg of weight, 

for two to three weeks. In a nonendemic trachoma region, this disease never progresses to chronicity. 

Finally, mention should be made of infantile pneumopathy. The syndrome is well 
defined; it is found in children affected by Chlamydia trachomatis. Less than ten children are affected 
by Chlamydia trachomatis pneumopathies per thousand births. The syndrome is, in this case, always 
10 found at an early age (less than four months). 

Venereal lymphogranulomatosis is an infection which is transmitted through sexual 
contact and is due to Chlamydia trachomatis strains LI, L2 and L3. In humans, a passing primary 
genital lesion is followed by an often suppurative and multiple regional lymphadenopathy. This 
disease is a general disease which is accompanied by fever and a rise in the number of white blood 
15 cells. If it progresses to chronicity, the disease then becomes complicated with genital elephantiasis, 
stricture or even fistula of the genital apparatus, of the penis, of the urethra and of the rectum. 

The three Chlamydia trachomatis strains LI, L2 and L3 are responsible for venereal 
lymphogranulomatosis. These Chlamydia strains are more virulent than the strains responsible for 
trachoma and STD. It is very important to note that venereal lymphogranulomatosis is a systemic 
20 disease which affects primarily the lymphatic tissue. Generally transmitted by the sexual route, 
Chlamydia trachomatis L may also cause contamination through direct contact or even during poor 
laboratory handling. In spite of these variable modes of transmission, the age for the highest incidence 
of these diseases corresponds to that for greater sexual activity. Venereal lymphogranulomatosis is 
still endemic in South America, in Africa and sometimes in Asia. For a long time, the prevalence of 
25 venereal lymphogranulomatosis was difficult to establish because of the difficulty of performing 
diagnosis with certitude. It should also be noted that men are affected more often than women. In low 
endemic regions, it is difficult to recognize the reservoir of microbes. This situation is explained by 
the fact that the isolation of the strains causing venereal lymphogranulomatoses from asymptomatic 
subjects is rarely successful. 
30 Clinical impairment by venereal lymphogranulomatosis manifests itself by the 

appearance of a small ulcer 3 to 21 days after the exposure of small nonpainful vesicles. In both men 
and women, the lesion is most often silent. Since this impairment disappears within a few days and 
causes no functional discomfort and leaves no visible scar, the disease is often recognized late. The 
venereal lymphogranulomatosis strains may be found in the urethra or the endocervix in patients with 
35 inguinal adenopathies; these regions are then considered as the initial site of infection. The 



WO 99/28475 PCT/IB98/01939 

6 

characteristic feature of the venereal lymphogranulomatosis strains is that from the initial site of 
infection, Chlamydia exhibits a diffusion drained by the lymphatic ducts. The disease is then 
complicated by a ganglionic impairment of the region draining the site of inoculation". By way of 
example, anorectal infection causes deep adenopathies. These adneopathies are marked by the 
5 appearance of a periadenitis which forms a fluctuating and suppurative ganglionic mass. Fistulae will 
appear during the decline of the disease. As general signs are present at this stage of the disease, it is 
often confused with a malignant lymphoma. The other general complications are rarely observed. 
Clinical examinations have been able to lead biologists to isolate Chlamydia from the cerebrospinal 
fluid or from the blood. It should also be noted that in a number of cases (5%), venereal 
10 lymphogranulomatosis is complicated by a chronic oedema; this is genital elephantiasis. 

The diagnosis of venereal lymphogranulomatosis requires the isolation of the 
Chlamydia strains involved in the disease. However, isolation on cell cultures is rarely used, but 
immunological reactions may be used. 

The treatment of venereal lymphogranulomatosis in its initial phase is identical to the 
15 treatment of other Chlamydia infections. In the chronic phases, antibiotics have little effect on the 
progress of the disease, but they are however useful in case of superinfection. Although the 
recommended therapeutic arsenal is identical, it is advisable to prolong the treatment for a period of at 
least four weeks. In addition to this treatment, reconstructive surgery may be useful in cases of 
urethral, penile or rectal strictures, as well as for the treatment of fistulae. 
20 In conclusion, a short and effective treatment, without recurrences, and a well- 

tolerated treatment of Chlamydia trachomatis infections therefore remains desirable. 

An even greater need up until now relates to a diagnosis which is specific to each of 
the strains, which is sensitive, which can be carried out conveniently and rapidly, and which allows 
early detection of the infection. 
25 No vaccine is currently available against Chlamydia trachomatis. The role of the 

immune defense in the physiology and pathology of the disease should probably be understood in 
order to develop satisfactory vaccines. 

More detailed information relating to the biology of these strains, their interactions 
with their hosts, the associated phenomena of infectivity and those of escaping the immune defenses 
30 of the host in particular, and finally their involvement in the development of the these associated 
• pathologies, will allow a better understanding of these mechanisms. In the light of the preceding text 
which shows in particular the limitations of the means of controlling Chlamydia trachomatis 
infection, it is therefore at present essential, on the one hand, to develop molecular tools, in particular 
from a better genetic knowledge of Chlamydia trachomatis, but also to develop new preventive and 
35 therapeutic treatments, new diagnostic methods and new vaccin strategies which are specific, 
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effective and tolerated. This is precisely the object of the present invention. 

The subject of the present invention is the nucleotide sequence having the sequence 
SEQ ID No. 1 of the Chlamydia trachomatis LGV2 genome. However, the invention is not limited to 
SEQ ID No. 1 , but encompasses genomes and nucleotides encoding polypeptides of strain variants, 
5 polymorphisms, allelic variants, and mutants. 

Thus, the subject of the present invention encompasses nucleotide sequences 
characterized in that they arc chosen from: 

a) the nucleotide sequence of SEQ ID No. 1, a nucleotide sequence exhibiting at least 
99.9% identity with the sequence SEQ ID No. 1, the nucleotide sequence of the genomic DNA 

10 contained within ECACC Deposit No. 98112618, the nucleotide sequence of a clone insert within 
ECACC Deposit No. 981 12617 (these being provisional deposit numbers); 

b) a nucleotide sequence homologous to the sequence SEQ ID No. 1 ; 

c) a polynucleotide sequence that hybridizes to the nucleotide sequence of a) under 
conditions of high or intermediate stringency as described below; 

15 (0 B y wav of example and not limitation, procedures using conditions of high stringency are 

as follows; Prehybridization of filters containing DNA is carried out for 8 h to overnight at 65°C in 
buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 
0.02% BSA, and 500 jig/ml denatured salmon sperm DNA. Filters are hybridized for 48 h at 65°C, 
the preferred hybridization temperature, in prehybridization mixture containing 100 jig/ml denatured 

20 salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeIed probe. Alternatively, the hybridization step 
can be performed at 65°C in the presence of SSC buffer, 1 x SSC corresponding to 0.15M NaCl and 
0.05 M Na citrate. Subsequently, filter washes can be done at 37°C for 1 h in a solution containing 
2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1X SSC at 50°C for 45 
min. Alternatively, filter washes can be performed in a solution containing 2 x SSC and 0.1% SDS, 

25 or 0.5 x SSC and 0. 1 % SDS, or 0. 1 x SSC and 0. 1 % SDS at 68°C for 15 minute intervals. Following 
the wash steps, the hybridized probes are detectable by autoradiography. Other conditions of high 
stringency which may be used are well known in the art and as cited in Sambrook et al„ 1989, 
Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47- 
9.57; and Ausubel et al., 1989, Current Protocols in Molecular Biology, Green Publishing Associates 

30 and Wiley Interscience, N. Y, are incorporated herein in their entirety. 

(ii) By way of example and not limitation, procedures using conditions of intermediate 
stringency are as follows: Filters containing DNA are prehybridized, and then hybridized at a 
temperature of 60°C in the presence of a 5 x SSC buffer and labeled probe. Subsequently, filters 
washes are performed in a solution containing 2x SSC at 50°C and th hybridized probes are 

35 detectable by autoradiography. Other conditions of intermediate stringency which may be used are 
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well known in the art and as cited in Sambrook et ah, 1989, Molecular Cloning, A Laboratory 
Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al. t 1989, 
Current Protocols in Molecular Biology, Green Publishing Associates and Wiley IntersCience, N.Y. 
are incorporated herein in their entirety. 
5 d) a nucleotide sequence complementary to the sequence SEQ ID No. 1 or 

complementary to a nucleotide sequence as defined in a), b) or c), and a nucleotide sequence of their 
corresponding RNA; 

e) a nucleotide sequence of a representative fragment of the sequence SEQ ID No. 1, or 
of a representative fragment of the nucleotide sequence as defined in a), b), c) or d); 
10 f) a nucleotide sequence comprising a sequence as defined in a), b), c), d) or e); 

g) a nucleotide sequence capable of being obtained from a nucleotide sequence as 
defined in a), b), c), d), e) or f); and 

h) a modified nucleotide sequence of a nucleotide sequence as defined in a), b), c), d), 
e),f)org). 

15 Sequence of the genome, or genomic sequence of Chlamydia trachomatis is 

understood to mean the sequence of the chromosome of Chlamydia trachomatis, in contrast with the 
plasmid sequence of Chlamydia trachomatis. 

Nucleotide sequence, polynucleotide or nucleic acid are understood to mean, 
according to the present invention, either a double-stranded DNA, a single-stranded DNA or products 

20 of transcription of the said DNAs. 

It should be understood that the present invention does not relate to the genomic 
nucleotide sequences of Chlamydia trachomatis taken in their natural environment, that is to say in 
the natural state. They are sequences which may have been isolated, purified or partially purified, by 
separation methods such as, for example, ion-exchange chromatography, molecular size exclusion 

25 chromatography or affinity chromatography, or alternatively fractionation techniques based on 
solubility in various solvents, or by genetic engineering methods such as amplification, cloning or 
subcloning, it being possible for the sequences of the invention to be carried by vectors. 

The nucleotide sequence SEQ ID No. 1 was obtained by sequencing the Chlamydia 
trachomatis LGV2 genome by the method of directed sequencing after fluorescent automated 

30 sequencing of the inserts of clones and assembling of these sequences of nucleotide fragments 
(inserts) by means of softwares (cf. Examples). In spite of the high precision of the sequence 
SEQ ID No. 1, it is possible that it does not perfectly, 100% represent the nucleotide sequence of the 
Chlamydia trachomatis LGV2 genome and that a few rare sequencing errors or uncertainties still 
remain in the sequence SEQ ID No. 1. In the present invention, the presence of an uncertainty for an 

35 amino acid is designated by «Xaa» and that for a nucleotide is designated by «N» in the sequence 
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listing below. These few rare errors or uncertainties could be easily detected and corrected by persons 
skilled in the art using the entire chromosome and/or its representative fragments according to the 
invention and standard amplification, cloning and sequencing methods, it being possible for the 
sequences obtained to be easily compared, in particular by means of a computer software and using 
5 computer-readable media for recording the sequences according to the invention as described, for 
example, below. After correcting these possible rare eiTors or uncertainties, the corrected nucleotide 
sequence obtained would still exhibit at least 99.9% identity with the sequence SEQ ID No. 1. Such 
rare sequencing uncertainties are not present within the DNA contained within ECACC Deposit No. 
98112617 or 98112618 (provisional numbers) and whatever rare sequence uncertainties that exist 
1 0 within SEQ ID No. 1 can routinely be corrected utilizing the DNA of the ECACC Deposits. 

Homologous nucleotide sequence for the purposes of the present invention is 
understood to mean a nucleotide sequence having a percentage identity with the bases of the 
nucleotide sequence SEQ ID No. 1 of at least 80%, preferably 90% and 95%, this percentage being 
purely statistical and it being possible for the differences between the two nucleotide sequences to be 

15 distributed randomly and over their entire length. The said homologous sequences exhibiting a 
percentage identity with the bases of the nucleotide sequence SEQ ID No. 1 of at least 80%, 
preferably 90% and 95%, may comprise, for example, the sequences corresponding to the genomic 
sequence or to the sequences of its representative fragments of a bacterium belonging to the 
Chlamydia family, including the species Chlamydia pneumoniae, Chlamydia psittaci and Chlamydia 

20 pecorum mentioned above, as well as the sequences corresponding to the genomic sequence or to the 
sequences of its representative fragments of a bacterium belonging to the variants of the species 
Chlamydia trachomatis. In the present invention, the terms family and genus are mutually 
interchangeable, the terms variant, serotype, strain and subspecies are also mutually interchangeable. 
These homologous sequences may thus correspond to variations linked to mutations within the same 

25 species or between species and may correspond in particular to truncations, substitutions, deletions 
and/or additions of at least one nucleotide. The said homologous sequences may also correspond to 
variations linked to the degeneracy of the genetic code or to a bias in the genetic code which is 
specific to the family, to the species or to the variant and which are likely to be present in Chlamydia. 

Protein and/or nucleic acid sequence homologies may be evaluated using any of the 

30 variety of sequence comparison algorithms and programs known in the art. Such algorithms and 
programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, and 
CLUSTALW (Pearson and Lipman, 1988, Proc. Natl Acad. ScL USA S5(<?;:2444-2448; Altschul et 
al, 1990, 7. Mol Biol 275^:403-410; Thompson et al, 1994, Nucleic Acids Res. 22(2,1:4673-4680; 
Higgins et al, 1996, Methods Enzymoi 266:383-402; Altschul et al, 1990, J. Mol Biol 215(3)AQ3- 

35 410; Altschul et al, 1993, Nature Genetics 3:266-272). 
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In a particularly preferred embodiment* protein and nucleic acid sequence homologies 
are evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well known in the 
art (see, e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. ScL USA 57:2267-2268; Altschul et al, 
1990, J. Mol Biol. 2/5:403-410; Altschul et al. t 1993, Nature Genetics 3:266-272; Altschul et al, 
1997, Nuc. Acids Res. 25:3389-3402). In particular, five specific BLAST programs are used to 
perform the following task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein sequence 
database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence database; 

(3) BLASTX compares the six-frame conceptual translation products of a query nucleotide 
sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence database 
translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence against the 
six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar segments, which are 
referred to herein as "high-scoring segment pairs," between a query amino or nucleic acid sequence 
and a test sequence which is preferably obtained from a protein or nucleic acid sequence database. 
High-scoring segment pairs are preferably identified {i.e., aligned) by means of a scoring matrix, 
many of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 matrix 
(Gonnetera/.. 1992, Science 256:1443-1445; Henikoff and Henikoff, 1993, Proteins 77:49-61). Less 
preferably, the PAM or PAM250 matrices may also be used (see, e.g., Schwartz and Dayhoff, eds., 
1978, Matrices for Detecting Distance Relationships: Atlas of Protein Sequence and Structure, 
Washington: National Biomedical Research Foundation) 

The BLAST programs evaluate the statistical significance of all high-scoring segment 
pairs identified, and preferably selects those segments which satisfy a user-specified threshold of 
significance, such as a user-specified percent homology. Preferably, the statistical significance of a 
high-scoring segment pair is evaluated using the statistical significance formula of Karlin (see, e.g., 
Karlin and Altschul, 1990, Proc. Natl. Acad. ScL USA 57:2267-2268). 

Nucleotide sequence complementary to a sequence of the invention is understood to 
mean any DNA whose nucleotides are complementary to those of the sequence of the invention, and 
whose orientation is reversed (antiparallel sequence). 

The present invention further comprises fragments of the sequences of a) through h) 
above. Representative fragments of the sequences according to the invention will be understood to 
mean any nucleotide fragment having at least 8 successive nucleotides, preferably at least 12 
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successive nucleotides, and still more preferably at least 15 or at least 20 successive nucleotides of the 
sequence from which it is derived. It is understood that such fragments refer only to portions of SEQ 
ID No. 1 that are not currently listed in a publicly available database. 

Among these representative fragments, those capable of hybridizing under stringent 
5 conditions with a nucleotide sequence according to the invention are preferred. Hybridization under 
stringent conditions means that the temperature and ionic strength conditions are chosen such that 
they allow hybridization to be maintained between two complementary DNA fragments. 

By way of illustration, high stringency conditions for the hybridization step for the 
purposes of defining the nucleotide fragments described above, are advantageously the following. 
10 The hybridization is carried out at a preferred temperature of 65°C in the presence of 

SSC buffer, 1 x SSC corresponding to 0.15 M NaCl and 0.05 M Na citrate. The washing steps may be, 
for example, the following: 

2xSSC, 0.1% SDS at room temperature followed by three washes with 1 x SSC, 0.1% SDS; 
0.5 x SSC, 0.1% SDS; 0.1 x SSC, 0.1% SDS at 68°C for 15 minutes. 

15 Intermediate stringency conditions, using, for example, a temperature of 60°C in the 

presence of a 5 x SSC buffer, or of low stringency, for example a temperature of 50°C in the presence 
of a 5 x SSC buffer, respectively require a lower overall complementarity for the hybridization 
between the two sequences. 

The stringent hybridization conditions described above for a polynucleotide of about 

20 300 bases in size will be adapted by persons skilled in the art for larger- or smaller-sized 
oligonucleotides, according to the teaching of Sambrook et al, 1989. 

Among the representative fragments according to the invention, those which can be 
used as primer or probe in methods which make it possible to obtain homologous sequences or their 
representative fragments according to the invention, or to reconstitute a genomic fragment found to be 

25 incomplete in the sequence SEQ ID No. 1 or carrying an error or an uncertainty, are also preferred, 
these methods, such as the polymerase chain reaction (PCR), cloning and sequencing of nucleic acid 
being well known to persons skilled in the art. These homologous nucleotide sequences corresponding 
to mutations or to inter- or intra-species variations, as well as the complete genomic sequence or one 
of its representative fragments capable of being reconstituted, of course form part of the invention, 

30 Among the said representative fragments, those which can be used as primer or probe 

in methods allowing diagnosis of the presence of Chlamydia trachomatis or one of its associated 
microorganisms as defined below are also preferred. 

The representative fragments capable of modulating, regulating, inhibiting or 
inducing the expression of a gene of Chlamydia trachomatis or one of its associated microorganisms, 

35 and/or capable of modulating the replication cycle of Chlamydia trachomatis or one of its associated 
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microorganisms in the host cell and/or organism, are also preferred. Replication cycle is intended to 
designate invasion, multiplication, intracellular localization, in particular retention in the vacuole and 
inhibition of the process of fusion to the lysosome, and propagation of Chlamydia trachomatis or one 
of its associated microorganisms from host cells to host cells. 
5 Among the said representative fragments, those corresponding to nucleotide 

sequences corresponding to open reading frames, called ORF sequences (ORF for open reading 
frame), and encoding polypeptides, such as for example, but without being limited thereto, the ORF 
sequences which will be later described, are finally preferred. 

The representative fragments according to the invention may be obtained, for 
10 example, by specific amplification, such as PCR, or after digestion, with appropriate restriction 
enzymes, of nucleotide sequences according to the invention; these methods are in particular 
described in the manual by Sambrook et al. f 1989. The said representative fragments may also be 
obtained by chemical synthesis when they are not too large in size and according to methods well 
known to persons skilled in the art. For example, such fragments can be obtained by isolating 
15 fragments of the genomic DNA of ECACC Deposit No. 981 12618 or a clone insert present at this 
ECACC Deposit No. 98112617 (provisional numbers). 

The representative fragments according to the invention may be used, for example, as 
primer, to reconstitute some of the said representative fragments, in particular those in which a 
portion of the sequence is likely to be missing or imperfect, by methods well known to persons skilled 
20 in the art such as amplification, cloning or sequencing techniques. 

Modified nucleotide sequence will be understood to mean any nucleotide sequence 
obtained by mutagenesis according to techniques well known to persons skilled in the art, and 
exhibiting modifications in relation to the normal sequences, for example mutations in the regulatory 
and/or promoter sequences for the expression of a polypeptide, in particular leading to a modification 
25 of the level of expression of the said polypeptide or to a modulation of the replicative cycle. 

Modified nucleotide sequence will also be understood to mean any nucleotide 
sequence encoding a modified polypeptide as defined below. 

The subject of the present invention also includes Chlamydia trachomatis nucleotide 
sequences characterized in that they are chosen from a nucleotide sequence of an open reading frame 
30 (ORF), that is, the ORF2 to ORF1 197 sequences. 

The ORF2 to ORF1197 nucleotide sequences are defined in Tables 1 and 2, infra, 
represented below by their position on the sequence SEQ ID No. 1. For example, the ORF10 
sequence is defined by the nucleotide sequence between the nucleotides at position 9828 and 10430 
on the sequence SEQ ID No. 1, ends included. ORF2 to ORF1197 have been identified via 
35 homology analyses as well as via analyses of potential ORF start sites, as discussed in the examples 
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below. It is to be understood that each identified ORF of the invention comprises a nucleotide 
sequence that spans the contiguous nucleotide sequence from the codon immediately 3' to the stop 
codon of the preceding ORF and through the 5' codon to the next stop codon of SEQ ID No,:l in- 
frame to the ORF nucleotide sequence. Table 2, infra, lists the beginning, end and potential start site 
5 of each of ORFs 2-1 197. In one embodiment, the ORF comprises the contiguous nucleotide sequence 
spanning from the potential ORF start site downstream (that is, 3') to the ORF stop codon (or the ORF 
codon immediately adjacent to and upstream of the ORF stop codon). ORF2 to ORF1 197 encode the 
polypeptides of SEQ ID No. 2 to SEQ ID No. 1 197. 

Upon introduction of minor frameshifts, certain individual ORFs can comprise larger 
10 «combined» ORFs. A list of such putative «combined» ORFs is shown in Table 3, below. For 
example, a combined ORF can comprise ORF 1076 and ORF 1073, including intervening in-frame, 
nucleotide sequences. The order of ORFs (5" to 30, within each «combined» ORF is as listed. It is to 
be understood that when ORF2 to ORF1197 are referred to herein, such reference is also meant to 
include «combined» ORFs. Polypeptide sequences encoded by such «combined» ORFs are also part 
15 of the present invention. 
Table 3 

ORF 1076, ORF 1073; 

ORF 3, ORF 2; 

ORF 23, ORF 22, ORF 21; 
20 ORF 1141, ORF 477, ORF 478, ORF 479; 

ORF 487, ORF 486, ORF 485, ORF 484, ORF 483, ORF 482, ORF 481; 

ORF 488, ORF 489; 

ORF 573, ORF 572, ORF 571; 

ORF 817, ORF 818; 
25 ORF 8 19, ORF 820; 

ORF 1037, ORF 1038; 

ORF 1071, ORF 1070; 

ORF 17. ORF 1077; 

ORF 1 185, ORF 933, ORF 934; 
30 ORF 1060, ORF 1059; 

ORF 155, ORF 156; 

ORF 679, ORF 680; 

ORF 879, ORF 878; 

ORF 1028; ORF 1029, 
35 and representative fragments. 
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Table 1 also depicts the results of homology searches that compared the sequences of 
the polypeptides encoded by each of the ORFs to sequences present in public published databases. It 
is understood that in one embodiment, those polypeptides listed in Table 1 as exhibiting greater than 
about 95% identity to a polypeptide present in a publicly disclosed database are not considered part of 
5 the present invention; likewise in this embodiment, those nucleotide sequences encoding such 
polypeptides are not considered part of the invention. In another embodiment, it is understood that 
those polypeptides listed in Table 1 as exhibiting greater than about 99% identity to a polypeptide 
present in a publicly disclosed database are not considered part of the invention; likewise, in this 
embodiment, those nucleotide sequences encoding such polypeptides are not considered part of the 
10 invention. 

The invention also relates to the nucleotide sequences characterized in that they 
comprise a nucleotide sequence chosen from: 

a) an ORF2 to ORF1197 , a «combined» ORF nucleotide sequence, the nucleotide 
sequence of the genomic DNA contained within ECACC Deposit No. 98112618 or the nucleotide 

15 sequence of a clone insert in ECACC Deposit No. 981 12617 according to the invention; 

b) a homologous nucleotide sequence exhibiting at least 80% identity across an entire 
ORF2 to ORF1 197 nucleotide sequence according to the invention or as defined in a); 

c) a polynucleotide sequence that hybridizes to ORF2 to ORF1 197 under conditions of 
high or intermediate stringency as described below: 

20 (0 B y wa y of example and not limitation, procedures using conditions of high 

stringency are as follows: Prehybridization of filters containing DNA is carried out for 8 h to 
overnight at 65°C in buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% 
PVP, 0.02% Ficoll, 0.02% BSA, and 500 ug/ml denatured salmon sperm DNA. Filters are hybridized 
for 48 h at 65°C, the preferred hybridization temperature, in prehybridization mixture containing 100 

25 ng/ml denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Alternatively, the 
hybridization step can be performed at 65°C in the presence of SSC buffer, 1 x SSC corresponding to 
0.15M NaCl and 0.05 M Na citrate. Subsequently, filter washes can be done at 37°C for 1 h in a 
solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1X 
SSC at 50°C for 45 min. Alternatively, filter washes can be performed in a solution containing 2 x 

30 SSC and 0.1% SDS, or 0.5 x SSC and 0.1% SDS, or 0.1 x SSC and 0.1% SDS at 68°C for 15 minute 
intervals. Following the wash steps, the hybridized probes are detectable by autoradiography. Other 
conditions of high stringency which may be used are well known in the art and as cited in Sambrook 
et al., 1989, Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press. 
N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, Current Protocols in Molecular Biology, Green 

35 Publishing Associates and Wiley Interscience, N.Y. are incorporated herein in their entirety. 
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Preferably, such sequences encode a homolog of a polypeptide encoded by one of ORF2 to ORF1 197. 
In one embodiment, such sequences encode a Chlamydia trachomatis polypeptide. 

(ii) By way of example and not limitation, procedures using conditions of 
intermediate stringency are as follows: Filters containing DNA are prehybridized, and then hybridized 
5 at a temperature of 60°C in the presence of a 5 x SSC buffer and labeled probe. Subsequently, filters 
washes are performed in a solution containing 2x SSC at 50°C and the hybridized probes are 
detectable by autoradiography. Other conditions of intermediate stringency which may be used are 
well known in the art and as cited in Sambrook et aL, 1989, Molecular Cloning, A Laboratory 
Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, 
10 Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. 
are incorporated herein in their entirety. Preferably, such sequences encode a homolog of a 
polypeptide encoded by one of ORF2 to ORF1197 . In one embodiment, such sequences encode a 
Chlamydia trachomatis polypeptide. 

d) a complementary or RNA nucleotide sequence corresponding to an ORF2 to 
15 ORF1 197 sequence according to the invention or as defined in a), b) or c); 

e) a nucleotide sequence of a representative fragment of an ORF2 to ORF1197 
sequence according to the invention or of a sequence as defined in a), b), c) or d); 

f) a nucleotide sequence capable of being obtained from an ORF2 to ORF1197 
sequence according to the invention or as defined in a), b), c), d) or e); and 

20 g) a modified nucleotide sequence of an ORF2 to ORF1 197 sequence according to the 

invention or as defined in a), b), c), d), e) or f). 

As regards the homology with the ORF2 to ORF1197 nucleotide sequences, the 
homologous sequences exhibiting a percentage identity with the bases of one of the ORF2 to 
ORF1197 nucleotide sequences of at least 80%, preferably 90% and 95%, are preferred. Such 

25 homologous sequences are identified routinely via, for example, the algorithms described above and 
in the examples below. The said homologous sequences correspond to the homologous sequences as 
defined above and may comprise, for example, the sequences corresponding to the ORF sequences of 
a bacterium belonging to the Chlamydia family, including the species Chlamydia pneumoniae, 
Chlamydia psittaci and Chlamydia pecorum mentioned above, as well as the sequences corresponding 

30 to the ORF sequences of a bacterium belonging to the variants of the species Chlamydia trachomatis. 
These homologous sequences may likewise correspond to variations linked to mutations within the 
same species or between species and may correspond in particular to truncations, substitutions, 
deletions and/or additions of at least one nucleotide. The said homologous sequences may also 
correspond to variations linked to the degeneracy of the genetic code or to a bias in the genetic code 

35 which is specific to the family, to the species or to the variant and which are likely to be present in 
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Chlamydia. 

The invention comprises the polypeptides encoded by a nucleotide sequence 
according to the invention, preferably by a representative fragment of the sequence SEQ ID No. 1 and 
corresponding to an ORF sequence, in particular the Chlamydia trachomatis polypeptides, 
5 characterized in that they are chosen from the sequences SEQ ID No. 2 to SEQ ID No. 1 197 , 

and representative fragments thereof. However, the invention is not limited to polypeptides encoded 
by ORFs in SEQ ID No. 1 and its corresponding ORF sequences, but encompasses polypeptides of 
strain variants, polymorphisms, allelic variants, and mutants. 

Thus, the invention also comprises the polypeptides characterized in that they 
10 comprise a polypeptide chosen from: 

a) a polypeptide encoded by a polynucleotide sequence in SEQ ID No. 1 (e.g., any 
polypeptide encoded by a polynucleotide sequence corresponding to ORF2 to ORF1197) and/or 
representative fragments thereof according to the invention; 

b) a polypeptide homologous to a polypeptide according to the invention, or as defined 

15 in a); 

c) a polypeptide encoded by a polynucleotide sequence that hybridizes to SEQ ID No. 1 
or ORF2 to ORF1 197 under high or intermediate stringency as described below: 

(i) By way of example and not limitation, procedures using conditions of high 
stringency are as follows: Prehybridization of filters containing DNA is carried out for 8 h to 

20 overnight at 65°C in buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% 
PVP, 0.02% Ficoll, 0.02% BSA, and 500 ^tg/ml denatured salmon sperm DNA. Filters are hybridized 
for 48 h at 65°C, the preferred hybridization temperature, in prehybridization mixture containing 100 
Hg/ml denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-Iabeled probe. Alternatively, the 
hybridization step can be performed at 65°C in the presence of SSC buffer, 1 x SSC corresponding to 

25 0.1 5M NaCl and 0.05 M Na citrate. Subsequently, filter washes can be done at 37°C for 1 h in a 
solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1X 
SSC at 50°C for 45 min. Alternatively, filter washes can be performed in a solution containing 2 x 
SSC and 0.1% SDS, or 0.5 x SSC and 0.1 % SDS, or 0.1 x SSC and 0.1% SDS at 68°C for 15 minute 
intervals. Following the wash steps, the hybridized probes are detectable by autoradiography. Other 

30 conditions of high stringency which may be used are well known in the art and as cited in Sambrook 
et al., 1989, Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, 
N.Y., pp. 9.47-9.57; and Ausubel et ah, 1989, Current Protocols in Molecular Biology, Green 
Publishing Associates and Wiley Interscience, N.Y. are incorporated herein in their entirety. 
Preferably, such sequences encode a homolog of a polypeptide encoded by one of ORF2 to ORF1 197 

35 , In one embodiment, such sequences encode a Chlamydia trachomatis polypeptide. 
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(ii) By way of example and not limitation, procedures using conditions of 
intermediate stringency are as follows: Filters containing DNA are prehybridized, and then hybridized 
at a temperature of 60°C in the presence of a 5 x SSC buffer and labeled probe. Subsequently, filters 
washes are performed in a solution containing 2x SSC at 50°C and the hybridized probes are 
5 detectable by autoradiography. Other conditions of intermediate stringency which may be used are 
well known in the art and as cited in Sambrook et al„ 1989, Molecular Cloning, A Laboratory 
Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, 
Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. 
are incorporated herein in their entirety. Preferably, such sequences encode a homolog of a 
10 polypeptide encoded by one of ORF2 to ORF1197 . In one embodiment, such sequences encode a 
Chlamydia trachomatis polypeptide. 

d) a fragment of at least 5 amino acids of a polypeptide according to the invention, or as 
defined in a), b) or c); 

e) a biologically active fragment of a polypeptide according to the invention, or as 
15 defined in a), b), c) or d); and 

f) a modified polypeptide of a polypeptide according to the invention, as defined in a), 
b), c), d) or e). 

In the present description, the terms polypeptide, peptide and protein are 

interchangeable. 

20 R should be understood that the invention does not relate to the polypeptides in 

natural form, that is to say that they are not taken in their natural environment but that they may have 
been isolated or obtained by purification from natural sources, or alternatively obtained by genetic 
recombination, or else by chemical synthesis and that they may, in this case, comprise nonnatural 
amino acids, as will be described below. 

25 Homologous polypeptide will be understood to designate the polypeptides exhibiting, 

in relation to the natural polypeptide, certain modifications such as in particular a deletion, addition or 
substitution of at least one amino acid, a truncation, an extension, a chimeric fusion, and/or a 
mutation, or polypeptides exhibiting post-translational modifications. Among the homologous 
polypeptides, those whose amino acid sequence exhibits at least 80%, preferably 90%, homology or 

30 identity with the amino acid sequences of the polypeptides according to the invention are preferred. In 
the case of a substitution, one or more consecutive or nonconsecutive amino acids are replaced by 
«equivalent» amino acids. The expression «equivalent» amino acid is intended here to designate any 
amino acid capable of being substituted for one of the amino acids in the basic structure without, 
however, essentially modifying the biological activities of the corresponding peptides and as will be 

35 defined later. 
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Protein and/or nucleic acid sequence homologies may be evaluated using any of the 
variety of sequence comparison algorithms and programs known in the art. Such algorithms and 
programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, and 
CLUSTALW (Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA S5fSJ:2444-2448; Altschul et 
5 al y 1990, / Mol Biol 2/5(5;:403-410; Thompson et a/., 1994, Nucleic Acids Res. 22(2J:4673-4680; 
Higgins et al, 1996, Methods Enzymol. 266:383-402; Altschul et al f 1990, J. Mol. Biol. 215(3)A03- 
410; Altschul et al t 1993, Nature Genetics 5:266-272). 

In a particularly preferred embodiment, protein and nucleic acid sequence homologies 
are evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well know in the art 
10 (see, e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 57:2267-2268; Altschul et al, 1990, 
J. Mol. Biol. 275:403-410; Altschul et al. y 1993, Nature Genetics 5:266-272; Altschul et al y 1997, 
Nuc. Acids Res. 25:3389-3402). In particular, five specific BLAST programs are used to perform the 
following task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein sequence 
database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence database; 

(3) BLASTX compares the six-frame conceptual translation products of a query nucleotide 
sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence database 
translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence against the 
six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar segments, which are 
referred to herein as "high-scoring segment pairs," between a query amino or nucleic acid sequence 
and a test sequence which is preferably obtained from a protein or nucleic acid sequence database. 
High-scoring segment pairs are preferably identified (i.e. t aligned) by means of a scoring matrix, 
many of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 matrix 
(Gonnet et a/., 1992, Science 256:1443-1445; Henikoff and Henikoff, 1993, Proteins 77:49-61). Less 
preferably, the PAM or PAM250 matrices may also be used (see, e.g., Schwartz and Dayhoff, eds., 
1978, Matrices for Detecting Distance Relationships; Atlas of Protein Sequence and Structure, 
Washington: National Biomedical Research Foundation) 

The BLAST programs evaluate the statistical significance of all high-scoring segment 
pairs identified, and preferably selects those segments which satisfy a user-specified threshold of 
significance, such as a user-specified percent homology. Preferably, the statistical significance of a 
high-scoring segment pair is evaluated using the statistical significance formula of Karlin (see, e.g., 
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Karlin and Altschul, 1990, Proc. Natl Acad. ScL USA 57:2267-2268). 

Equivalent amino acids may be determined either based on their structural homology 
with the amino acids for which they are substituted, or on results of comparative tests of biological 
activity between the various polypeptides which may be carried out. 
5 By way of example, there may be mentioned the possibilities of substitutions which 

may be carried out without resulting in a substantial modification of the biological activity of the 
corresponding modified polypeptides; the replacements, for example, of leucine with valine or 
isoleucine, of aspartic acid with glutamic acid, of glutamine with asparagine, of arginine with lysine, 
and the like, the reverse substitutions naturally being feasible under the same conditions. 
10 The homologous polypeptides also correspond to the polypeptides encoded by the 

homologous nucleotide sequences as defined above and thus comprise in the present definition the 
mutated polypeptides or polypeptides corresponding to inter- or intra-species variations which may 
exist in Chlamydia, and which correspond in particular to truncations, substitutions, deletions and/or 
additions of at least one amino acid residue. 
15 Biologically active fragment of a polypeptide according to the invention will be 

understood to designate in particular a polypeptide fragment, as defined below, exhibiting at least one 
of the characteristics of the polypeptides according to the invention, in particular in that it is: 

capable of eliciting an immune response directed against Chlamydia trachomatis; and/or 
capable of being recognized by an antibody specific for a polypeptide according to the 
20 invention; and/or 

capable of binding to a polypeptide or to a nucleotide sequence of Chlamydia trachomatis; 

and/or 

capable of modulating, regulating, inducing or inhibiting the expression of a gene of 
Chlamydia trachomatis or one of its associated microorganisms, and/or capable of modulating the 

25 replication cycle of Chlamydia trachomatis or one of its associated microorganisms in the host cell 
and/or organism; and/or 

capable of generally exerting an even partial physiological activity, such as for example a 
structural activity (cellular envelope, ribosome), an enzymatic (metabolic) activity, a transport 
activity, an activity in the secretion or in the virulence. 

30 A representative polypeptide fragment according to the invention is understood to 

designate a polypeptide comprising a minimum of 5 amino acids, preferably 10 amino acids or 
preferably 15 amino acids. It is to be understood that such fragments refer only to portions of 
polypeptides encoded by ORF2 or ORF1197 that arc not currently listed in a publicly available 
database. 

35 The polypeptide fragments according to the invention may correspond to isolated or 
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purified fragments which are naturally present in Chlamydia trachomatis or which are secreted by 
Chlamydia trachomatis, or may correspond to fragments capable of being obtained by cleaving the 
said polypeptide with a proteolytic enzyme, such as trypsin or chymotrypsin or collagenase, or with a 
chemical reagent, such as cyanogen bromide (CNBr) or alternatively by placing the said polypeptide 
5 in a highly acidic environment, for example at pH 2.5. Such polypeptide fragments may be equally 
well prepared by chemical synthesis, using hosts transformed with an expression vector according to 
the invention containing a nucleic acid allowing the expression of the said fragments, placed under 
the control of appropriate elements for regulation and/or expression. 

«Modified polypeptide* of a polypeptide according to the invention is understood to 

10 designate a polypeptide obtained by genetic recombination or by chemical synthesis as will be 
described below, exhibiting at least one modification in relation to the normal sequence. These 
modifications may in particular affect amino acids responsible for a specificity or for the efficiency of 
the activity, or responsible for the structural conformation, for the charge or for the hydrophobicity, 
and for the capacity for multimerization and for membrane insertion of the polypeptide according to 

15 the invention. It is thus possible to create polypeptides with an equivalent, an increased or a reduced 
activity, and with an equivalent, a narrower or a broader specificity. Among the modified 
polypeptides, there may be mentioned the polypeptides in which up to 5 amino acids may be 
modified, truncated at the N- or C-terminal end, or alternatively deleted, or else added. 

As is indicated, the modifications of the polypeptide may have in particular the 

20 objective: 

of making it capable of modulating, regulating, inhibiting or inducing the expression of a gene 
of Chlamydia, in particular of Chlamydia trachomatis and its variants, or one of its associated micro- 
organisms, and/or capable of modulating the replication cycle of Chlamydia, in particular of 
Chlamydia trachomatis and its variants, or one of its associated microorganisms, in the host cell 
25 and/or organism, 

of allowing its use in methods of biosynthesis or of biodegradation, or its incorporation into 
vaccine compositions, 

of modifying its bioavailability as a compound for therapeutic use. 

The said modified polypeptides may also be used on any cell or microorganism for 
30 which the said modified polypeptides will be capable of modulating, regulating, inhibiting or inducing 
gene expression, or of modulating the growth or the replication cycle of the said cell or of the said 
microorganism. The methods allowing demonstration of the said modulations on eukaryotic or 
prokaryotic cells are well known to persons skilled in the art. The said cells or microorganisms will be 
chosen, in particular, from tumour cells or infectious microorganisms and the said modified 
35 polypeptides may be used for the prevention or treatment of pathologies linked to the presence of the 
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said cells or of the said microorganisms. It is also clearly understood that the nucleotide sequences 
encoding the said modified polypeptides may be used for the said modulations, for example by the 
intermediacy of vectors according to the invention and which are described below, so as to prevent or 
to treat the said pathologies. 
5 The above modified polypeptides may be obtained using combinatory chemistry, in 

which it is possible to systematically vary portions of the polypeptide before testing them on models, 
cell cultures or microorganisms for example, so as to select the compounds which are the most active 
or which exhibit the desired properties. 

Chemical synthesis also has the advantage of being able to use: 
10 - nonnatural amino acids, or 

- nonpeptide bonds. 

Accordingly, in order to extend the life of the polypeptides according to the 
invention, it may be advantageous to use nonnatural amino acids, for example in the D form, or 
alternatively amino acid analogues, in particular sulphur-containing forms for example. 

*5 Finally, the structure of the polypeptides according to the invention, its homologous 

or modified forms, as well as the corresponding fragments may be integrated into chemical structures 
of the polypeptide type and the like. Accordingly, it may be advantageous to provide at the N- and 
C-terminal ends compounds which are not recognized by proteases. 

Also forming part of the invention are the nucleotide sequences encoding a 

20 polypeptide according to the invention. Described below are ORF nucleotide sequences encoding 
polypeptides exhibiting particularly preferable characteristics. For each group of preferred ORFs 
described below, it is to be understood that in addition to the individual ORFs listed, in instances 
wherein such ORFs are present as part of «combined» ORFs, the «combined» ORFs are also to be 
included within the preferred group. 

25 More particularly, the subject of the invention is nucleotide sequences, characterized 

in that they encode a polypeptide of the cellular envelope, preferably of the outer cellular envelope of 
Chlamydia trachomatis or one of its representative fragments, such as for example the predominant 
proteins of the outer membrane, the adhesion proteins or the proteins entering into the composition of 
the Chlamydia wall. Among these sequences, the sequences comprising a nucleotide sequence chosen 

30 from the following sequences are most preferred: 

ORF3; ORF19; ORF51; ORE189; ORF212; ORF213; ORF324; ORF477; ORF478; ORF479; 
ORF481; ORF482; ORF483; ORF484; ORF486; ORF488; ORF489; ORF490; ORF572; ORF573; 
ORF742; ORF817; ORF818; ORF820; ORF1035; ORF1036; ORF1037; ORF1038; ORF1070; 
ORF1071; ORF1073 and one of their representative fragments. 

35 The structure of the cytoplasmic membranes and of the wall of bacteria is dependent 
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on the associated proteins. The structure of the cytoplasmic membrane makes it impermeable to 
water, to water-soluble substances and to small-sized molecules (ions, small inorganic molecules, 
peptides or proteins). To enter into or to interfere with a cell or a bacterium, a ligand must establish a 
special relationship with a protein anchored in the cytoplasmic membrane (the receptor). These 
5 proteins which are anchored on the membrane play an important role in metabolism since they control 
the exchanges in the bacterium. These exchanges apply to molecules of interest for the bacterium 
(small molecules such as sugars and small peptides) as well as undesirable molecules for the 
bacterium such as antibiotics or heavy metals. 

The double lipid layer structure of the membrane requires the proteins which are 
10 inserted therein to have hydrophobic domains of about twenty amino acids forming an alpha helix. 
Predominantly hydrophobic and potentially transmembrane regions may be predicted from the 
primary sequence of the proteins, itself deduced from the nucleotide sequence. The presence of one or 
more putative transmembrane domains raises the possibility for a protein to be associated with the 
cytoplasmic membrane and to be able to play an important metabolic role therein or alternatively for 
1 5 the protein thus exposed to be able to exhibit potentially protective epitopes. 

If the proteins inserted into the membrane exhibit several transmembrane domains 
capable of interacting with one another via electrostatic bonds, it then becomes possible for these 
proteins to form pores which go across the membrane which becomes permeable for a number of 
substances. It should be noted that proteins which do not have transmembrane domains may also be 
20 anchored by the intermediacy of fatty acids in the cytoplasmic membrane, it being possible for the 
breaking of the bond between the protein and its anchor in some cases to be responsible for the 
release of the peptide outside the bacterium. 

Preferably, the invention relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis transmembrane polypeptide or 
25 one of its representative fragments, having between 1 and 3 transmembrane domains and in that they 
comprise a nucleotide sequence chosen from the following sequences: 

ORF2; ORF3; ORF5; ORF8; ORF9; ORF10; ORF11; ORF12; ORF17; ORF21; ORF26; ORF27; 

ORF28; ORF29; ORF30; ORF31; ORF33; ORF35; ORF37; ORF39; ORF40; ORF41; ORF42; 

ORF43; ORF44; ORF45; ORF46; ORF47; ORF48; ORF49; ORF52; ORF53; ORF55; ORF56; 
30 ORF58; ORF65; ORF66; ORF68; ORF70; ORF74; ORF75; ORF76; ORF78; ORF79; ORF81; 

ORF82; ORF83; ORF86; ORF91; ORF92; ORF94; ORF97; ORF100; ORF102; ORF103; ORF105; 

ORF106; ORF107; ORF109; ORF110; ORF111; ORF112; ORF113; ORF114; ORF115; ORF116; 

ORF117; ORF120; ORF122; ORF123; ORF130; ORF134; ORF135; ORF137; ORF140; ORF141; 

ORF143; ORF144; ORF145; ORF147; ORF148; ORF149; ORF150; ORF151; ORF155; ORF156; 
35 ORF162; ORF163; ORF164; ORF165; ORF166; ORF167; ORF168; ORF169; ORF170; ORF171; 
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ORF173; ORF175; ORF176; ORF177; 0RF181; ORF183; ORF184; ORF186; ORF187; ORF188 
ORF190; 0RF191; ORF192; ORF194; ORF195; ORF196; ORF197; ORF198; ORF199; ORF201; 
ORF202; ORF204; ORF206; ORF207; ORF209; ORF212; ORF213; ORF217; ORF219; ORF220; 
ORF221; ORF222; ORF223; ORF224; ORF225; ORF227; ORF228; ORF231; ORF232; ORF234; 
5 ORF236; ORF237; ORF243; ORF244; ORF245; ORF247; ORF248; ORF249; ORF252; ORF254; 
ORF257; ORF260; ORF261; ORF263; ORF265; ORF266; ORF267; ORF270; ORF271; ORF272; 
ORF274; ORF276; ORF277; ORF278; ORF279; ORF282; ORF283; ORF284; ORF285; ORF287; 
ORF289; ORF290; ORF291; ORF294; ORF298; ORF305; ORF306; ORF310; 0RF311; ORF313; 
ORF315; ORF316; ORF319; ORF320; ORF322; ORF323; ORF325; ORF326; ORF327; ORF328; 
10 ORF330; ORF331; ORF332; ORF333; ORF334; ORF335; ORF336; ORF338; ORF339; ORF340; 
ORF341; ORF344; ORF345; ORF348; ORF349; ORF350; ORF351; ORF352; ORF353; ORF356; 
ORF357; ORF358; ORF361; ORF362; ORF366; ORF367; ORF368; ORF370; ORF372; ORF373; 
ORF375; ORF377; ORF378; ORF379; ORF380; ORF382; ORF383; ORF384; ORF385; ORF387; 
ORF389; ORF390; ORF391; ORF393; ORF396; ORF398; ORF399; ORF403; ORF404; ORF406; 
15 ORF407; ORF413; ORF414; ORF417; ORF418; ORF420; ORF421; ORF424; ORF426; ORF427; 
ORF428; ORF430; ORF433; ORF434; ORF435; ORF436; ORF437; ORF440; ORF443; ORF446; 
ORF448; ORF450; ORF451; ORF454; ORF455; ORF457; ORF458; ORF459; ORF463; ORF464; 
ORF466; ORF467; ORF468; ORF469; ORF470; ORF473; ORF474; ORF475; ORF476; ORF477; 
ORF479; ORF480; ORF481; ORF483; ORF484; ORF485; ORF486; ORF487; ORF488; ORF491; 
20 ORF493; ORF496; ORF497; ORF498; ORF500; ORF501; ORF503; ORF504; ORF508; ORF512; 
ORF513; ORF514; ORF519; ORF521; ORF523; ORF524; ORF526; ORF527; ORF529; ORF530; 
ORF531; ORF532; ORF534; ORF536; ORF537; ORF538; ORF540; ORF541; ORF542; ORF543; 
ORF544; ORF545; ORF546; ORF547; ORF551; ORF552; ORF553; ORF555; ORF558; ORF559; 
ORF560; ORF561; ORF562; ORF566; ORF567; ORF568; ORF569; ORF571; ORF572; ORF574; 
25 ORF575; ORF576; ORF580; ORF582; ORF585; ORF587; ORF589; ORF592; ORF593; ORF595; 
ORF596; ORF597; ORF599; ORF601; ORF602; ORF603; ORF604; ORF608; ORF609; ORF610; 
ORF611; ORF615; ORF616; ORF617; ORF618; ORF621; ORF622; ORF623; ORF624; ORF625; 
ORF628; ORF632; ORF633; ORF634; ORF635; ORF637; ORF638; ORF640; ORF641; ORF643; 
ORF646; ORF648; ORF649; ORF651; ORF652; ORF653; ORF654; ORF655; ORF658; ORF664; 
30 ORF665; ORF666; ORF668; ORF669; ORF670; ORF671; ORF672; ORF673; ORF674; ORF676; 
ORF677; ORF678; ORF680; ORF682; ORF683; ORF684; ORF686; ORF688; ORF689; ORF690; 
ORF691; ORF692; ORF693; ORF695; ORF696; ORF698; ORF701; ORF703; ORF704; ORF705; 
ORF706; ORF707; ORF709; ORF710; ORF711; ORF712; ORF713; ORF714; ORF715; ORF717; 
ORF718; ORF720; ORF721; ORF722; ORF724; ORF726; ORF728; ORF729; ORF730; ORF731; 
35 ORF732; ORF733; ORF734; ORF737; ORF738; ORF739; ORF740; ORF742; ORF743; ORF744; 
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ORF745; ORF746; ORF748; ORF750; ORF751; ORF752; ORF753; ORF754; ORF755; ORF757 
ORF758; ORF759; ORF760; ORF764; ORF766; ORF768; ORF769; ORF771; ORF772; ORF773; 
ORF774; ORF775; ORF776; ORF777; ORF778; ORF779; ORF780; ORF781; ORF782; ORF783; 
ORF786; ORF787; ORF788; ORF789; ORF790; ORF793; ORF798; ORF800; ORF802; ORF803; 
5 ORF806; ORF808; ORF809; ORF810; 0RF811; ORF813; ORF814; ORF817; ORF820; ORF822; 
ORF824; ORF825; ORF827; ORF828; ORF829; ORF830; ORF833; ORF834; ORF835; ORF837; 
ORF838; ORF839; ORF840; ORF841; ORF842; ORF843; ORF845; ORF848; ORF849; ORF850; 
ORF851; ORF852; ORF854; ORF855; ORF856; ORF857; ORF859; ORF860; ORF862; ORF863; 
ORF864; ORF866; ORF869; ORF872; ORF873; ORF874; ORF878; ORF879; ORF880; ORF881; 
10 ORF883; ORF884; ORF885; ORF886; ORF887; ORF892; ORF893; ORF894; ORF895; ORF897; 
ORF899; ORF900; ORF901; ORF904; ORF906; ORF909; ORF910; ORF912; ORF914; ORF917; 
ORF920; ORF921; ORF922; ORF923; ORF924; ORF925; ORF926; ORF927; ORF930; ORF933; 
ORF934; ORF935; ORF936; ORF937; ORF940; ORF941; ORF942; ORF943; ORF944; ORF945; 
ORF947; ORF948; ORF951; ORF952; ORF953; ORF954; ORF955; ORF956; ORF957; ORF958; 
15 ORF960; ORF961; ORF962; ORF963; ORF964; ORF966; ORF967; ORF969; ORF970; ORF971; 
ORF973; ORF974; ORF979; ORF980; ORF981; ORF982; ORF984; ORF988; ORF989; ORF990; 
ORF991; ORF995; ORF996; ORF999; ORF1001; ORF1003; ORF1004; ORF1005; ORF1006; 
ORF1007; ORF1009; ORF1010; ORF1011; ORF1012; ORF1013; ORF1014; ORF1016; ORF1017; 
ORF1018; ORF1020; ORF1021; ORF1025; ORF1026; ORF1027; ORF1029; ORF1030; ORF1031; 
20 ORF1035; ORF1036; ORF1037; ORF1038; ORF1039; ORF1040; ORF1044; ORF1045; ORF1047; 
ORF1048; ORF1050; ORF1051; ORF1052; ORF1053; ORF1055; ORF1056; ORF1057; ORF1058; 
ORF1061; ORF1062; ORF1063; ORF1064; ORF1065; ORF1066; ORF1068; ORF1069; ORF1072; 
ORF1074; ORF1076 and one of their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
25 invention, characterized in that they encode a Chlamydia trachomatis transmembrane polypeptide or 
one of its representative fragments, having between 4 and 6 transmembrane domains and in that they 
comprise a nucleotide sequence chosen from the following sequences: 

ORF7; ORF14; ORF16; ORF32; ORF34; ORF36; ORF38; ORF50; ORF57; ORF59; ORF61; ORF62; 

ORF63; ORF64; ORF67; ORF69; ORF72; ORF77; ORF80; ORF84; ORF87; ORF93; ORF95; 
30 ORF99; ORF108; ORF119; ORF125; ORF126; ORF129; ORF131; ORF136; ORF139; ORF146; 

ORF152; ORF154; ORF160; ORF161; ORF172; ORF179; ORF182; ORF185; ORF200; ORF203; 

ORF205; ORF239; ORF242; ORF250; ORF253; ORF256; ORF259; ORF262; ORF268; ORF275; 

ORF281; ORF286; ORF288; ORF292; ORF295; ORF296; ORF297; ORF299; ORF300; ORF308; 

ORF314; ORF317; ORF318; ORF324; ORF342; ORF343; ORF355; ORF360; ORF374; ORF376; 
35 ORF386; ORF388; ORF392; ORF394; ORF395; ORF402; ORF405; ORF411; ORF415; ORF416; 
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ORF422; ORF423; ORF429; ORF432; ORF441; ORF442; ORF444; ORF449; ORF452; ORF456; 

ORF460; ORF461; ORF465; ORF471; ORF472; ORF482; ORF489; ORF492; ORF494; ORF495; 

ORF502; ORF505; ORF506; ORF509; ORF516; ORF517; ORF520; ORF525; ORF533; ORF539; 

ORF549; ORF554; ORF557; ORF563; ORF570; ORF573; ORF581; ORF590; ORF591; ORF600; 
5 ORF607; ORF612; ORF613; ORF620; ORF626; ORF629; ORF630; ORF639; ORF644; ORF647; 

ORF656; ORF659; ORF661; ORF685; ORF687; ORF699; ORF700; ORF708; ORF716; ORF719; 

ORF725; ORF747; ORF749; ORF756; ORF765; ORF767; ORF794; ORF796; ORF797; ORF799; 

ORF801; ORF807; ORF821; ORF823; ORF826; ORF847; ORF853; ORF861; ORF870; ORF871; 

ORF875; ORF882; ORF888; ORF889; ORF898; ORF902; ORF903; 0RF911; ORF916; ORF931; 
10 ORF939; ORF975; ORF976; ORF978; ORF983; ORF986; ORF987; ORF992; ORF993; ORF1000; 

ORF1002; ORF1008; ORF1019; ORF1022; ORF1032; ORF1034; ORF1046; ORF1054; ORF1060; 

ORF1071 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis transmembrane polypeptide or 
15 one of its representative fragments, having at least 7 transmembrane domains and in that they 

comprise a nucleotide sequence chosen from the following sequences: 

ORF4; ORF6; ORF13; ORF20; ORF51; ORF71; ORF88; ORF118; ORF128; ORF132; ORF133; 

ORF158; ORF159; ORF174; ORF180; ORF189; ORF210; ORF211; ORF214; ORF215; ORF226; 

ORF229; ORF233; ORF235; ORF240; ORF246; ORF251; ORF255; ORF273; ORF354; ORF364; 
20 ORF369; ORF371; ORF397; ORF401; ORF409; ORF412; ORF419; ORF439; ORF453; ORF462; 

ORF490; ORF510; ORF511; ORF518; ORF535; ORF548; ORF550; ORF564; ORF565; ORF578; 

ORF579; ORF614; ORF631; ORF636; ORF650; ORF662; ORF667; ORF679; ORF681; ORF702; 

ORF727; ORF741; ORF763; ORF791; ORF792; ORF815; ORF816; ORF832; ORF846; ORF858; 

ORF865; ORF867; ORF868; ORF877; ORF891; ORF896; ORF907; ORF908; ORF918; ORF919; 
25 ORF932; ORF959; ORF977; ORF994; ORF998; ORF1024; ORF1028; ORF1042; ORF1067; 

ORF1070; ORF1073 and one of their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis surface exposed polypeptide 
an outer membrane protein) or one of its representative fragments, said nucleotide sequences 
30 comprising a nucleotide sequence chosen from the following sequences: 

ORF 2, ORF 3, ORF 21, ORF 22, ORF 23, ORF 53, ORF 77, ORF 187, ORF 203, ORF 383, ORF 

477, ORF 478, ORF 479, ORF 481, ORF 482, ORF 483, ORF 484, ORF 485, ORF 486, ORF 487, 

ORF 488, ORF 489, ORF 490, ORF 571, ORF 572, ORF 573, ORF 593, ORF 670, ORF 693, ORF 

742, ORF 749, ORF 801, ORF 817, ORF 818, ORF 819, ORF 820, ORF 851, ORF 902, ORF 923, 
35 ORF 1035. ORF 1036, ORF 1037, ORF 1038, ORF 1069, ORF 1070. ORF 1071, ORF 1073, ORF 
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1076, ORF 1095, ORF 1096, ORF 1141, ORF 1181, and their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis lipoprotein or one of its 
representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from 
5 the following sequences: 

ORF 29, ORF 42, ORF 66, ORF 72, ORF 76, ORF 78, ORF 148, ORF 154, ORF 180, ORF 182. ORF 
184, ORF 187, ORF 200, ORF 242, ORF 245, ORF 250, ORF 253, ORF 272, ORF 274, ORF 275, 
ORF 308, ORF 350, ORF 362, ORF 383, ORF 394, ORF 396, ORF 399, 

ORF 422, ORF 488, ORF 535, ORF 568, ORF 573, ORF 578, ORF 593, ORF 607, ORF 625, ORF 
10 662, ORF 669, ORF 688, ORF 690, ORF 716, ORF 773, ORF 778, ORF 78 1, ORF 783, ORF 788, 
ORF 817, ORF 848, ORF 851, ORF 853, ORF 857, ORF 875, ORF 877, ORF 886, ORF 898, ORF 
902, ORF 923, ORF 938, ORF 976, ORF 978, ORF 990, ORF 1005, ORF 1021, ORF 1035, ORF 
1069, ORF 1083, ORF 1088, ORF 1089, ORF 1091, ORF 1092, ORF 1095, ORF 1096, ORF 1100, 
ORF 1105. ORF 1108, ORF 1117, ORF 1120, ORF 1121, ORF 1124, ORF 1128. ORF 1133, ORF 
15 1 135. ORF 1 139, ORF 1 140, ORF 1 157, ORF 1 159, ORF 1 163, ORF 1 165, ORF 1 167, ORF 1 168, 
ORF 1169, ORF 1171, ORF 1173, ORF 1174, ORF 1177, ORF 1180, ORF 1181, ORF 1186, ORF 
1 194, ORF 1 197, and their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis polypeptide involved in 
20 lipopolysaccharide (LPS) biosynthesis, said nucleotide sequences comprising a nucleotide sequence 
chosen from the following sequences: ORF 17, ORF 201, ORF 691, ORF 807, ORF 936, ORF 983. 
ORF 1019, ORF 1077 and one of their representative fragments. 

Preferably the invention relates to additional LPS-related nucleotide sequences 
according to the invention, characterized in that they encode: 
25 ( a > a Chlamydia trachomatis KDO (3-deoxy-D-manno-octulosonic acid)-related 

polypeptide or one of its representative fragments, said nucleotide sequences comprising a nucleotide 
sequence chosen from the following sequences: ORF 41, ORF 242, ORF 269, ORF 772, and one of 
their representative fragments; 

(b) a Chlamydia trachomatis phosphomannomutase-related polypeptide or one of 
30 its representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from 

the following sequence: ORF 139, and one of its representative fragments; 

(c) a Chlamydia trachomatis phosphoglucomutase-related polypeptide or one of its 
representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from 
the following sequence: ORF 567, and one of its representative fragments; and 

35 ( d ) a Chlamydia trachomatis lipid A component-related polypeptide or one of its 
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representative fragments, said nucleotide sequences comprising a nucleotide sequence chosen from 
the following sequences: ORF 4, ORF 933, ORF 934, ORF 935, ORF 1185, and one of their 
representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
5 invention, characterized in that they encode a Chlamydia trachomatis Type III or other, non-Type in 
secreted polypeptides or one of its representative fragments, said nucleotide sequences comprising a 
nucleotide sequence chosen from the following sequences: ORF 180, ORF 181, ORF 207, ORF 208, 
ORF 372, ORF 391, ORF 399, ORF 477, ORF 486, ORF 749, ORF 758, ORF 819, ORF 878, ORF 
888, ORF 896, ORF 897, ORF 900, ORF 902, ORF 923, ORF 1015, ORF 1018, ORF 1059, ORF 
10 1060, ORF 1069, ORF 1071, ORF 1073, ORF 1076, ORF 1 189, and their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis polypeptide containing RGD 
(Arg-Gly-Asp) attachment sites or one of its representative fragments: 

15 ( a ) RGD-containing proteins that are outer membrane proteins, are more likely to 

play a role in cell attachment. ORFs that encoded a protein containing an RGD sequence and also 
were classified as outer membrane proteins are ORF 488, ORF 489, ORF 571, ORF 572, ORF 573 or 
ORF 716, and its representative fragments. 

20 ( b ) The outer membrane of Chlamydia is made of cysteine-rich proteins that form 

a network of both intra and inter molecular disulfide links. This contributes to the integrity of the 
membrane since Chlamydia lacks the peptidoglycan layer that other gram-negative bacteria have. 
Cysteine-rich proteins that have the RGD sequence are also considered to be potential vaccine 
candidates. Cysteine-rich proteins were defined as proteins that had more than 3.0% cysteine in their 

25 primary amino acid sequence, above the mean genomic ORF cysteine content. The corresponding 
ORF is: ORF 1 144 and one of its representative fragments. 

(c) The outer membrane of Chlamydia may also contain small proteins that have 
cysteines in their N- and C-terminus that may contribute to the network formed by disulfide linkages. 

30 These proteins may be anchored in the outer membrane via their N-terminus and may have their C- 
terminus exposed, which then can interact with the host cells. Alternatively, these proteins may be 
anchored in the outer membrane via both N-and C-terminus and may have regions in the middle that 
may be exposed which can in turn interact with the host cells. ORFs encoding polypeptides that 
contain cysteines in their first 30 amino acids and also contain an RGD sequence are: ORF 101, ORF 

35 122, ORF 308, ORF 488, ORF 489, ORF 571, ORF 572, ORF 573, ORF 651, ORF 679, ORF 680, 
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ORF 705, ORF 716, ORF 763, ORF 870, ORF 878, ORF 879, ORF 995, ORF 1028, ORF 1029, ORF 
1 176, and one of their representative fragments. 

(d) RGD-containing ORFs homologous to RGD-containing ORFs from 
5 Chlamydia pneumoniae are: 

ORF 28, ORF 101, ORF 125, ORF 155, ORF 156, ORF 286, ORF 571, ORF 572, ORF 573, ORF 
763, ORF 870, and one of their representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
10 invention, characterized in that they encode a Chlamydia trachomatis cell wall anchored surface 
polypeptide or one of its representative fragments, said nucleotide sequences comprising a nucleotide 
sequence chosen from the following sequences: ORF 662, ORF 681 , ORF 1 182, ORF 1 192, and their 
representative fragments. 

Preferably, the invention relates to the nucleotide sequences according to the 
15 invention, characterized in that they encode Chlamydia trachomatis polypeptides not found in 
Chlamydia pneumoniae (Blastp P>e" 10 ), said nucleotide sequences comprising a nucleotide sequence 
chosen from the following sequences: ORF 2, ORF 18, ORF 60, ORF 66, ORF 67, ORF 68, ORF 69, 
ORF 70, ORF 81, ORF 89, ORF 107, ORF 108, ORF 109, ORF 134, ORF 147, ORF 191, ORF 194, 
ORF 216, ORF 217, ORF 218, ORF 219, ORF 220, ORF 221, ORF 222, ORF 223, ORF 224, ORF 

20 225, ORF 228, ORF 235, ORF 257, ORF 276, ORF 277, ORF 278, ORF 279, ORF 280. ORF 281, 
ORF 282, ORF 283, ORF 284, ORF 285, ORF 289, ORF 291, ORF 298, ORF 313, ORF 314, ORF 
315, ORF 316, ORF 334, ORF 335, ORF 336, ORF 337, ORF 338, ORF 339, ORF 340, ORF 381, 
ORF 393, ORF 413, ORF 418, ORF 419, ORF 420, ORF 421, ORF 422, ORF 423, ORF 436, ORF 
460, ORF 475, ORF 476, ORF 480, ORF 485, ORF 487, ORF 491, ORF 492, ORF 493, ORF 494, 

25 ORF 496, ORF 500, ORF 504, ORF 514, ORF 527, ORF 559, ORF 569. ORF 570, ORF 575, ORF 
580, ORF 582, ORF 593, ORF 598, ORF 632, ORF 640, ORF 651, ORF 671, ORF 690, ORF 694, 
ORF 698, ORF 710, ORF 722. ORF 723, ORF 724. ORF 770, ORF 771, ORF 782, ORF 783, ORF 
784, ORF 790, ORF 795, ORF 798, ORF 805, ORF 810, ORF 817, ORF 829, ORF 830, ORF 864, 
ORF 866, ORF 876, ORF 887, ORF 892, ORF 899, ORF 913, ORF 921, ORF 933, ORF 938, ORF 

30 949, ORF 956, ORF 1010, ORF 1017, ORF 1018, ORF 1027, ORF 1030, ORF 1037, ORF 1038, ORF 
1047, ORF 1072, ORF 1074, ORF 1075, ORF 1078, ORF 1079, ORF 1081, ORF 1083, ORF 1084, 
ORF 1087, ORF 1088, ORF 1089, ORF 1091, ORF 1092, ORF 1094, ORF 1095. ORF 1096, ORF 
1098, ORF 1104, ORF 1105, ORF 1106, ORF 1108, ORF 1110, ORF 1114, ORF 1115, ORF 1116, 
ORF 1117, ORF 1119, ORF 1128, ORF 1132, ORF 1133, ORF 1135, ORF 1136, ORF 1139, ORF 

35 1140, ORF 1141, ORF 1142, ORF 1144, ORF 1148, ORF 1151, ORF 1155, ORF 1157. ORF 1159, 
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ORF 1161, ORF 1162, ORF 1165, ORF 1166, ORF 1167, ORF 1168, ORF 1169, ORF 1171, ORF 
1172, ORF 1173, ORF 1174, ORF 1175, ORF 1176, ORF 1177, ORF 1178, ORF 1180, ORF 1181, 
ORF 1183, ORF 1184, ORF 1186, ORF 1187, ORF 1188, ORF 1192, ORF 1194, ORF 1197, and their 
representative fragments. 

5 Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 

representative fragments which is involved in the intermediate metabolism, in particular in the 

metabolism of sugars and/or of cofactors, such as for example triose phosphate isomerase or pyruvate 

kinase, and in that they comprise a nucleotide sequence chosen from the following sequences: 
10 ORF10; ORF44; ORF45; ORF46; ORF47; ORF93; ORF101; ORF102; ORF103; ORF106; ORF107; 

ORF120; ORF121; ORF130; ORF135; ORF140; ORF143; ORF144; ORF145; ORF158; ORF159; 

ORF160; ORF161; ORF192; ORF193; ORF196; ORF197; ORF198; ORF199; ORF227; ORF229; 

ORF236; ORF236; ORF239; ORF243; ORF245; ORF264; ORF265; ORF297; ORF331; ORF333; 

ORF359; ORF360; ORF374; ORF404; ORF405; ORF405; ORF410; ORF415; ORF415; ORF416; 
15 ORF417; ORF432; ORF460; ORF461; ORF462; ORF495; ORF513; ORF515; ORF566; ORF566; 

ORF566; ORF589; ORF613; ORF645; ORF646; ORF647; ORF652; ORF653; ORF654; ORF672; 

ORF673; ORF674; ORF682; ORF684; ORF692; ORF700; ORF725; ORF801; ORF802; ORF835; 

ORF836; ORF837; ORF860; ORF861; ORF862; ORF863; ORF869; ORF869; ORF925; ORF964; 

ORF983 and one of their representative fragments. 
20 Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 

representative fragments which is involved in the intermediate metabolism of nucleotides or nucleic 

acids, such as for example CTP synthetase or GMP synthetase, and in that they comprise a nucleotide 

sequence chosen from the following sequences: 
25 ORF142; ORF142; ORF169; ORF256; ORF268; ORF325; ORF352; ORF366; ORF435; ORF444; 

ORF528; ORF529; ORF530; ORF548; ORF549; ORF601; ORF602; ORF617; ORF619; ORF644; 

ORF745; ORF971 ; ORF972; ORF1023 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 
30 representative fragments which is involved in the metabolism of nucleic acids, such as for example 

DNA polymerases or DNA topoisomerases, and in that they comprise a nucleotide sequence chosen 

from the following sequences: 

ORF5; ORF12; ORF82; ORF96; ORF97; ORF98; ORF99; ORF100; ORF105; ORF118; ORF136; 
ORF137; ORF163; ORF190; ORF204; ORF259; ORF260; ORF262; ORF290; ORF300; ORF301; 
35 ORF302; ORF387; ORF427; ORF434; ORF441; ORF444; ORF471; ORF595; ORF596; ORF597; 
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ORF599; ORF600; ORF605; ORF612; ORF624; ORF625; ORF650; ORF657; ORF658; ORF702; 

ORF703; ORF704; ORF708; ORF719; ORF766; ORF767; ORF775; ORF779; ORF787; ORF788; 

ORF794; ORF841; ORF842; ORF883; ORF884; ORF907; ORF918; ORF924; ORF928; ORF929; 

ORF962; ORF962; ORF963; ORF969; ORF970; ORF975; ORF979; ORF995; ORF1031; ORF1032 
5 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 

representative fragments which is involved in the metabolism of amino acids or polypeptides, such as 

for example serine hydroxymethyl transferase or the proteins which load amino acids onto transfer 
1 0 RNAs, and in that they comprise a nucleotide sequence chosen from the following sequences: 

ORF27; ORF41; ORF55; ORF56; ORF57; ORF59; ORF62; ORF63; ORF64; ORF65; ORF119; 

ORF132; ORF240; ORF241; ORF277; ORF278; ORF279; ORF382; ORF406; ORF428; ORF442; 

ORF446; ORF447; ORF453; ORF454; ORF541; ORF542; ORF591; ORF608; ORF609; ORF610; 

ORF618; ORF648; ORF649; ORF660; ORF661; ORF677; ORF717; ORF765; ORF797; ORF871; 
15 ORF875; ORF920; ORF922; ORF937; ORF998; ORF1020; ORF1021; ORF1034; ORF1044; 

ORF1046; ORF1049 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 

representative fragments which is involved in the metabolism of polypeptides, such as for example 
20 protein kinases or proteases, and in that they comprise a nucleotide sequence chosen from the 

following sequences: 

ORF21; ORF22; ORF23; ORF24; ORF25; ORF26; ORF75; ORF84; ORF86; ORF92; ORF133; 

ORF151; ORF152; ORF157; ORF179; ORF209; ORF307; ORF326; ORF343; ORF344; ORF345; 

ORF371; ORF429; ORF519; ORF557; ORF586; ORF587; ORF630; ORF656; ORF706; ORF707; 
25 ORF730; ORF751; ORF752; ORF786; ORF847; ORF885; ORF923; ORF978; ORF1039; ORF1048 

and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 

representative fragments which is involved in the metabolism of fatty acids, such as for example 
30 succinyl-CoA-synthesizing proteins or phosphatidylserine synthetase, and in that they comprise a 

nucleotide sequence chosen from the following sequences: 

ORF4; ORF15; ORF16; ORF141; ORF173; ORF205; ORF205; ORF206; ORF207; ORF208; 
ORF312; ORF355; ORF415; ORF550; ORF558; ORF560; ORF561; ORF574; ORF574; ORF577; 
ORF578; ORF590; ORF614; ORF772; ORF808; ORF809; ORF904; ORF905; ORF905; ORF933; 
35 ORF934; ORF934; ORF936 and one of their representative fragments. 
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Preferably, the invention also relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis polypeptide or on of its 
representative fragments which is involved in the synthesis of the wall, such as for example 
KDO transferase, and the proteins responsible for the attachment of certain sugars onto the exposed 
5 proteins, and in that they comprise a nucleotide sequence chosen from the following sequences: 

ORF87; ORF196; ORF242; ORF269; ORF628; ORF629; ORF634; ORF635; ORF637; ORF638; 
ORF1019 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 
10 representative fragments which is involved in the transcription, translation and/or maturation process, 
such as for example initiation factors, RNA polymerases or certain chaperone proteins, and in that 
they comprise a nucleotide sequence chosen from the following sequences: 

ORF112; ORF113; ORF332; ORF212; ORF213; ORF350; ORF362; ORF363; ORF364; ORF407; 

ORF451; ORF546; ORF643; ORF744; ORF746; ORF833; ORF868; ORF981; ORF982; ORF1003; 
15 ORF101 1 ; ORF1042 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 

invention, characterized in that they encode a Chlamydia trachomatis ribosomal polypeptide or one of 

its representative fragments, such as for example the ribosomal proteins L21, L27 and SI 0, and in that 

they comprise a nucleotide sequence chosen from the following sequences: 
20 ORF114; ORF115; ORF116; ORF328; ORF361; ORF375; ORF445; ORF543; ORF584; ORF585; 

ORF743; ORF813; ORF941; ORF942; ORF944; ORF946; ORF947; ORF948; ORF950; ORF951; 

ORF952; ORF953; ORF954; ORF955; ORF955; ORF957; ORF958; ORF960; ORF961; ORF1040; 

ORF1041; ORF1043; ORF1063; ORF1064 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
25 invention, characterized in that they encode a Chlamydia trachomatis transport polypeptide or one of 

its representative fragments, such as for example the proteins for transporting amino acids, sugars and 

certain oligopeptides, and in that they comprise a nucleotide sequence chosen from the following 

sequences: 

ORF6; ORF50; ORF51; ORF80; ORF125; ORF126; ORF128; ORF129; ORF215; ORF246; ORF248; 
30 ORF249; ORF251; ORF252; ORF253; ORF255; ORF273; ORF275; ORF293; ORF309; ORF323; 

ORF324; ORF398; ORF401; ORF449; ORF511; ORF512; ORF564; ORF565; ORF667; ORF679; 

ORF680; ORF711; ORF712; ORF713; ORF714; ORF715; ORF730; ORF731; ORF736; ORF737; 

ORF738; ORF870; ORF908; ORF919; ORF977; ORF987; ORF988; ORF992; ORF993; ORF994; 

ORF1028; ORF1029 and one of their representative fragments. 
35 Preferably, the invention also relates to the nucleotide sequences according to the 
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invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 
representative fragments which is involved in the virulence process, such as for example the proteins 
analogous to the Escherichia coli vacB protein, and in that they comprise a nucleotide sequence 
chosen from the following sequences: 
5 ORF20; ORF815; ORF816; ORF898; ORF1059; ORF1060 and one of their representative fragments. 

Preferably, the invention also relates to the nucleotide sequences according to the 
invention, characterized in that they encode a Chlamydia trachomatis polypeptide or one of its 
representative fragments which is involved in the secretory system and/or which is secreted, such as 
for example proteins homologous to proteins in the secretory system of certain bacteria such as the 
10 Salmonellae or the Yersiniae, and in that they comprise a nucleotide sequence chosen from the 
following sequences: 

ORF758; ORF888; ORF889; ORF890; ORF891; ORF896; ORF897; ORF898 and one of their 
representative fragments. 

Preferably, the invention also relates to nucleotide sequences according to the 
15 invention, characterized in that they encode a polypeptide specific to Chlamydiae or one of its 
representative fragments, and in that they comprise a nucleotide sequence chosen from the following 
sequences: 

ORF22; ORF29; ORF31; ORF32; ORF34; ORF35; ORF39; ORF40; ORF43; ORF48; ORF49; 

ORF50; ORF52; ORF53; ORF54; ORF72; ORF77; ORF78; ORF87; ORF90; ORF95; ORF108; 
20 ORF110; ORFlll; ORF122; ORF123; ORF124; ORF127; ORF138; ORF144; ORF146; ORF153; 

ORF155; ORF164; ORF166; ORF175; ORF182; ORF184; ORF186; ORF187; ORF188; ORF202; 

ORF210; ORF247; ORF258; ORF266; ORF267; ORF270; ORF273; ORF274; ORF295; ORF296; 

ORF305; ORF306; ORF309; ORF318; ORF319; ORF322; ORF326; ORF342; ORF357; ORF376; 

ORF379; ORF380; ORF388; ORF390; ORF400; ORF431; ORF433; ORF438; ORF443; ORF456; 
25 ORF457; ORF458; ORF464; ORF468; ORF470; ORF473; ORF486; ORF489; ORF497; ORF501; 

ORF503; ORF504; ORF508; ORF512; ORF521; ORF522; ORF523; ORF524; ORF533; ORF535; 

ORF536; ORF537; ORF538; ORF539; ORF540; ORF554; ORF563; ORF572; ORF579; ORF595; 

ORF603; ORF604; ORF606; ORF607; ORF615; ORF616; ORF622; ORF641; ORF642; ORF659; 

ORF668; ORF670; ORF693; ORF695; ORF696; ORF699; ORF703; ORF704; ORF716; ORF726; 
30 ORF728; ORF739; ORF742; ORF747; ORF750; ORF751; ORF755; ORF757; ORF759; ORF761; 

ORF762; ORF763; ORF764; ORF773; ORF780; ORF781; ORF789; ORF800; ORF803; ORF804; 

ORF818; ORF820; ORF822; ORF823; ORF824; ORF827; ORF828; ORF839; ORF849; ORF850; 

ORF851; ORF852; ORF855; ORF856; ORF857; ORF858; ORF859; ORF860; ORF861; ORF862; 

ORF863; ORF865; ORF868; ORF869; ORF870; ORF871; ORF872; ORF873; ORF874; ORF875; 
35 ORF877; ORF878; ORF880; ORF882; ORF884; ORF886; ORF893; ORF901; ORF906; ORF910; 
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ORF912; ORF915; ORF916; ORF917; ORF926; ORF929; ORF933; ORF965; ORF967; ORF968; 
ORF984; ORF986; ORF989; ORF990; ORF996; ORF997; ORF1001; ORF1002; ORF1013; 
ORF1016; ORF1031; ORF1033; ORF1035; ORF1049; ORF1051; ORF1052; ORF1054; ORF1056; 
ORF1057; ORF1058; ORF1062; ORF1070; ORF1071; ORF1073 and one of their representative 
5 fragments. 

Also forming part of the invention are polypeptides encoded by the polynucleotides 
of the invention, as well as fusion polypeptides comprising such polypeptides. In one embodiment, 
the polypeptides and fusion polypeptides immunoreact with seropositive serum of an individual 
infected with Chlamydia trachomatis. For example, described below, are polypeptide sequences 

10 exhibiting particularly preferable characteristics. For each group of preferred polypeptides described 
below, it is to be understood that in addition to the individual polypeptides listed, in instances wherein 
such polypeptides are encoded as part of «combined» ORFs, such «combined» polypeptides are also 
to be included within the preferred group. 

The subject of the invention is also a polypeptide according to the invention, 

15 characterized in that it is a polypeptide of the cellular envelope, preferably of the outer cellular 
envelope, of Chlamydia trachomatis or one of its representative fragments. According to the 
invention, the said polypeptide is preferably chosen from the polypeptides having the following 
sequences: 

SEQ ID No. 3; SEQ ID No. 19; SEQ ID No. 51; SEQ ID No. 189; SEQ ID No. 212; SEQ ID No. 213; 
20 SEQ ID No. 324; SEQ ID No. 477; SEQ ID No. 478; SEQ ID No. 479; SEQ ID No. 481; 

SEQ ID No. 482; SEQ ID No. 483; SEQ ID No. 484; SEQ ID No. 486; SEQ ID No. 488; 

SEQ ED No. 489; SEQ ID No. 490; SEQ ID No. 572; SEQ ID No. 573; SEQ ID No, 742; 

SEQ ID No. 817; SEQ ID No. 818; SEQ ID No. 820; SEQ ID No. 1035; SEQ ID No. 1036; 

SEQ ID No. 1037; SEQ ID No. 1038; SEQ ID No. 1070; SEQ ID No. 1071; SEQ ID No. 1073 and 
25 one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 

characterized in that it is a Chlamydia trachomatis transmembrane polypeptide or one of its 

representative fragments, having between 1 and 3 transmembrane domains, and in that it is chosen 

from the polypeptides having the following sequences: 
30 SEQ ID No. 2; SEQ ID No. 3; SEQ ID No. 5; SEQ ID No. 8; SEQ ID No. 9; SEQ ID No. 10; 

SEQ ID No. 11; SEQ ID No. 12; SEQ ID No. 17; SEQ ID No. 21; SEQ ID No. 26; SEQ ID No. 27; 

SEQ ID No. 28; SEQ ID No. 29; SEQ ID No. 30; SEQ ID No. 31; SEQ ID No. 33; SEQ ID No. 35; 

SEQ ID No. 37; SEQ ID No. 39; SEQ ED No. 40; SEQ ID No. 41; SEQ ID No. 42; SEQ ID No. 43; 

SEQ ID No. 44; SEQ ID No. 45; SEQ ID No. 46; SEQ ID No. 47; SEQ ID No. 48; SEQ ID No. 49; 
35 SEQ ID No. 52; SEQ ID No. 53; SEQ ID No. 55; SEQ ID No. 56; SEQ ID No. 58; SEQ ID No. 65; 
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SEQIDNo.66; SEQIDNo.68; SEQ ID No. 70; SEQIDNo.74; SEQIDNo.75; SEQIDNo.76 
SEQIDNo.78; SEQIDNo.79; SEQ ID No. 81; SEQIDNo.82; SEQ ID No. 83; SEQIDNo.86 
SEQ ID No. 91; SEQ ID No. 92; SEQ ID No. 94; SEQ ID No. 97; SEQ ID No. 100; SEQTD No. 102 



SEQ ID No. 103; 
5 SEQ ID No. 110; 
SEQ ID No. 115; 
SEQ ID No. 123; 
SEQ ID No. 140; 
SEQ ID No. 147; 
10 SEQ ID No. 155; 
SEQ ID No. 165; 
SEQ ID No. 170; 
SEQ ID No. 177; 
SEQ ID No. 187; 
15 SEQ ID No. 194; 
SEQ ID No. 199; 
SEQ ID No. 207; 
SEQ ID No. 219; 
SEQ ID No. 224; 
20 SEQ ID No. 232; 
SEQ ID No. 244; 
SEQ ID No. 252; 
SEQ ID No. 263; 
SEQ ID No. 271; 
25 SEQ ID No. 278; 
SEQ ID No. 285; 
SEQ ID No. 294; 
SEQ ID No. 311; 
SEQ ID No. 320; 
30 SEQ ID No. 327; 
SEQ ID No. 333; 
SEQ ID No. 339; 
SEQ ID No. 348; 
SEQ ID No. 353; 
35 SEQ ID No. 362; 



SEQ ID No. 105 
SEQ ID No. Ill 
SEQ ID No. 116 
SEQ ID No. 130 
SEQ ID No. 141 
SEQ ID No. 148 
SEQ ID No. 156 
SEQ ID No. 166 
SEQ ID No. 171 
SEQ ID No. 181 
SEQ ID No. 188 
SEQ ID No. 195 
SEQ ID No. 201 
SEQ ID No. 209 
SEQ ID No. 220 
SEQ ID No. 225 
SEQ ID No. 234: 
SEQ ID No. 245 
SEQ ID No. 254 
SEQ ID No. 265 
SEQ ID No. 272 
SEQ ID No. 279 
SEQ ID No. 287 
SEQ ID No. 298 
SEQ ID No. 313 
SEQ ID No. 322 
SEQ ID No. 328 
SEQ ID No. 334 
SEQ ID No. 340 
SEQ ID No. 349 
SEQ ID No. 356 
SEQ ID No. 366 



SEQ ID No. 106; 

SEQ ID No. 112; 

SEQ ID No. 117; 

SEQ ID No. 134; 

SEQ ID No. 143; 

SEQ ID No. 149; 

SEQ ID No. 162; 

SEQ ID No. 167; 

SEQ ID No. 173; 

SEQ ID No. 183; 

SEQ ID No. 190; 

SEQ ID No. 196; 

SEQ ID No. 202; 

SEQ ID No. 212; 

SEQ ID No. 221; 

SEQ ID No. 227; 
SEQ ID No. 236; 
SEQ ID No. 247; 
SEQ ID No. 257; 
SEQ ID No. 266; 
SEQ ID No. 274; 
SEQ ID No. 282; 
SEQ ID No. 289; 
SEQ ID No. 305; 
SEQ ID No. 315; 
SEQ ID No. 323; 
SEQ ID No. 330; 
SEQ ID No. 335; 
SEQ ID No. 341; 
SEQ ID No. 350; 
SEQ ID No. 357; 
SEQ ID No. 367; 



SEQ ID No. 107 
SEQ ID No. 113 
SEQ ID No. 120 
SEQIDNo. 135 
SEQ ID No. 144 
SEQ ID No. 150 
SEQ ID No. 163 
SEQIDNo. 168 
SEQIDNo. 175 
SEQIDNo. 184 
SEQIDNo. 191 
SEQ ID No. 197 
SEQ ID No. 204 
SEQIDNo. 213 
SEQ ID No. 222 
SEQ ID No. 228 
SEQ ID No. 237 
SEQ ID No. 248 
SEQ ID No. 260 
SEQ ID No. 267 
SEQ ID No. 276 
SEQIDNo. 283 
SEQ ID No. 290 
SEQ ID No. 306 
SEQ ID No. 316 
SEQ ID No. 325 
SEQIDNo. 331 
SEQ ID No. 336 
SEQ ID No. 344; 
SEQ ID No. 351 
SEQ ID No. 358 
SEQ ID No. 368 



SEQIDNo. 109 
SEQIDNo. 114 
SEQIDNo. 122 
SEQIDNo. 137 
SEQIDNo. 145 
SEQIDNo. 151 
SEQIDNo. .164 
SEQIDNo. 169 
SEQIDNo. 176 
SEQIDNo. 186 
SEQ ID No. 192 
SEQIDNo. 198 
SEQ ID No. 206 
SEQ ID No. 217 
SEQ ID No. 223 
SEQ ID No. 231 
SEQ ID No. 243 
SEQ ID No. 249 
SEQ ID No. 261 
SEQ ID No. 270 
SEQ ID No. 277 
SEQIDNo. 284 
SEQ ID No. 291 
SEQ ID No. 310; 
SEQ ID No. 319 
SEQ ID No. 326 
SEQ ID No. 332 
SEQ ID No. 338 
SEQ ID No. 345 
SEQ ID No. 352 
SEQIDNo. 361 
SEQ ID No. 370 
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SEQ ID No. 372 
SEQ ID No. 379 
SEQ ID No. 385 
SEQ ID No. 393 
5 SEQ ID No. 404 
SEQ ID No. 417 
SEQ ID No. 426 
SEQ ED No. 434 
SEQ ED No. 443: 
10 SEQ ED No. 454: 
SEQ ID No. 463; 
SEQ ID No. 469; 
SEQ ID No. 476; 

SEQ ED No. 483; 
15 SEQ ID No. 488; 

SEQ ED No. 498; 

SEQ ED No. 508; 

SEQ ED No. 521; 

SEQ ED No. 529; 
20 SEQ ED No. 536; 

SEQ ED No. 542; 

SEQ ED No. 547; 

SEQ ID No. 558; 

SEQ ED No. 566; 
25 SEQ ED No. 572; 

SEQ ED No. 582; 

SEQ ED No. 593; 

SEQ ED No. 601; 

SEQ ED No. 609; 
30 SEQ ED No. 617; 

SEQ ED No. 624; 

SEQ ID No. 634, 

SEQ ED No. 641; 

SEQ ID No. 651; 
35 SEQ ID No. 658; 



SEQ ED No. 373 
SEQ ID No. 380 
SEQ ED No. 387 
SEQ ID No. 396 
SEQ ED No. 406; 
SEQ ED No. 418; 
SEQ ED No. 427; 
SEQ ED No. 435; 
SEQ ID No. 446; 
SEQ ID No. 455; 
SEQ ED No. 464; 
SEQ ED No. 470; 
SEQ ED No. 477; 
SEQ ED No. 484; 
SEQ ED No. 491; 
SEQ ID No. 500; 
SEQ ID No. 5 12; 
SEQ ED No. 523; 
SEQ ED No. 530; 
SEQ ED No. 537; 
SEQ ED No. 543; 
SEQ ED No. 551; 
SEQ ED No. 559; 
SEQ ED No. 567; 
SEQ ED No. 574; 
SEQ ED No. 585; 
SEQ ED No. 595; 
SEQ ED No. 602; 
SEQ ED No. 610; 
SEQ ID No. 618; 
SEQ ED No. 625; 
SEQ ED No. 635; 
SEQ ED No. 643; 
SEQ ID No. 652; 
SEO ED No. 664: 



35 

SEQ ID No. 375; 
SEQ ED No. 382; 
SEQ ED No. 389; 
SEQ ED No. 398; 
SEQ ID No. 407; 
SEQ ED No. 420; 
SEQ ED No. 428; 
SEQ ID No. 436; 
SEQ ED No. 448; 
SEQ ED No. 457; 
SEQ ED No. 466; 
SEQ ID No. 473; 
SEQ ED No. 479; 
SEQ ED No. 485; 
SEQ ED No. 493; 
SEQ ID No. 501; 
SEQ ED No. 513; 
SEQ ED No. 524; 
SEQ ED No. 531; 
SEQ ED No. 538; 
SEQ ED No. 544; 
SEQ ED No. 552; 
SEQ ED No. 560; 
SEQ ID No. 568; 
SEQ ED No. 575; 
SEQ ID No. 587; 
SEQ ED No. 596; 
SEQ ED No. 603; 
SEQ ED No. 611; 
SEQ ED No. 621; 
SEQ ED No. 628; 
SEQ ED No. 637; 
SEQ ED No. 646; 
SEQ ED No. 653; 
SEQ ED No. 665; 



SEQ ED No. 377 
SEQ ED No. 383: 
SEQ ED No. 390; 
SEQ ID No. 399; 
SEQ ED No. 413; 
SEQ ED No. 421; 
SEQ ED No. 430; 
SEQ ED No. 437; 
SEQ ED No. 450; 
SEQ ID No. 458; 
SEQ ID No. 467; 
SEQ ED No. 474; 
SEQ ED No. 480; 
SEQ ED No. 486; 
SEQ ED No. 496; 
SEQ ED No. 503; 
SEQ ID No. 514; 
SEQ ED No. 526; 
SEQ ED No. 532; 
SEQ ED No. 540; 
SEQ ED No. 545; 
SEQ ID No. 553; 
SEQ ED No. 561; 
SEQ ED No. 569; 
SEQ ED No. 576; 
SEQ ID No. 589; 
SEQ ED No. 597; 
SEQ ED No. 604; 
SEQ ED No. 615; 
SEQ ED No. 622; 
SEQ ED No. 632; 
SEQ ED No. 638; 
SEQ ED No. 648; 
SEQ ED No. 654; 
SEQ ED No. 666; 



SEQ ED No. 378 
SEQ ED No. 384; 
SEQ ID No. 391; 
SEQ ED No. 403; 
SEQ ED No. 414; 
SEQ ED No. 424; 
SEQ ED No. 433; 
SEQ ED No. 440; 
SEQ ID No. 451; 
SEQ ID No. 459; 
SEQ ED No. 468; 
SEQ ED No. 475; 
SEQ ED No. 481; 
SEQ ID No. 487; 
SEQ ED No. 497; 
SEQ ED No. 504; 
SEQ ID No. 519; 
SEQ ID No. 527; 
SEQ ED No. 534; 
SEQ ED No. 541; 
SEQ ID No. 546; 
SEQ ED No. 555; 
SEQ ED No. 562; 
SEQ ED No. 571; 
SEQ ED No. 580; 
SEQ ED No. 592; 
SEQ ED No. 599; 
SEQ ED No. 608; 
SEQ ED No. 616; 
SEQ ED No. 623; 
SEQ ED No. 633; 
SEQ ED No. 640; 
SEQ ED No. 649; 
SEQ ED No. 655; 
SEQ ID No. 668; 
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SEQ ID No. 669 

SEQ ID No. 674 

SEQ ID No. 682 

SEQ ID No. 689 
5 SEQ ID No. 695 

SEQ ID No. 704 

SEQ ID No. 7 10 

SEQ ID No. 715 

SEQ ED No. 722: 
10 SEQ ID No. 730; 

SEQ ID No. 737; 

SEQ ID No. 743; 

SEQ ID No. 750; 

SEQ ID No. 755; 
15 SEQ ID No. 764; 

SEQ ID No. 772; 

SEQ ID No. 777; 

SEQ ED No. 782; 

SEQ ID No. 789; 
20 SEQ ID No. 802; 

SEQ ID No. 810; 

SEQ ED No. 820; 

SEQ ED No. 828; 

SEQ ED No. 835; 
25 SEQ ED No. 841; 

SEQ ED No. 849; 

SEQ ID No. 855; 

SEQ ID No. 862; 

SEQ ID No. 872; 
30 SEQ ED No. 880; 

SEQ ED No. 886; 

SEQ ED No. 895; 

SEQ ID No. 904; 

SEQ ED No. 914; 
35 SEQ ED No. 923; 



SEQ ED No. 670; 
SEQ ED No. 676; 
SEQ ED No. 683; 
SEQ ED No. 690; 
SEQ ED No. 696; 
SEQ ED No. 705; 
SEQ ID No. 711; 
SEQ ED No. 717; 
SEQ ED No. 724; 
SEQ ED No. 731; 
SEQ ID No. 738; 
SEQ ED No. 744; 
SEQ ED No. 751; 
SEQ ID No. 757; 
SEQ ED No. 766; 
SEQ ED No. 773; 
SEQ ED No. 778; 
SEQ ID No. 783; 
SEQ ED No. 790; 
SEQ ED No. 803; 
SEQ ED No. 811; 
SEQ ID No. 822; 
SEQ ED No. 829; 
SEQ ED No. 837; 
SEQ ID No. 842; 
SEQ ED No. 850; 
SEQ ED No. 856; 
SEQ ED No. 863; 
SEQ ED No. 873; 
SEQ ED No. 881; 
SEQ ED No. 887; 
SEQ ED No. 897; 
SEQ ED No. 906; 
SEQ ED No. 917; 
SEOEDNo. 924: 
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SEQ ED No. 671; 
SEQ ED No. 677; 
SEQ ED No. 684; 
SEQ ED No. 691; 
SEQ ED No. 698; 
SEQ ED No. 706; 
SEQ ED No. 712; 
SEQ ED No. 718; 
SEQ ED No. 726; 
SEQ ED No. 732; 
SEQ ED No. 739; 
SEQ ED No. 745; 
SEQ ED No. 752; 
SEQ ED No. 758; 
SEQ ED No. 768; 
SEQ ID No. 774; 
SEQ ED No. 779; 
SEQ ED No. 786; 
SEQ ED No. 793; 
SEQ ED No. 806; 
SEQ ID No. 813; 
SEQ ED No. 824; 
SEQ ED No. 830; 
SEQ ED No. 838; 
SEQ ED No. 843; 
SEQ ED No. 851; 
SEQ ED No. 857; 
SEQ ED No. 864; 
SEQ ED No. 874; 
SEQ ED No. 883; 
SEQ ED No. 892; 
SEQ ED No. 899; 
SEQ ED No. 909; 
SEQ ED No. 920; 
SEQ ED No. 925; 



SEQ ED No. 672; 
SEQ ED No. 678; 
SEQ ED No. 686; 
SEQ ED No. 692; 
SEQ ED No. 701; 
SEQ ID No. 707; 
SEQ ED No. 713; 
SEQ ED No. 720; 
SEQ ED No. 728; 
SEQ ED No. 733; 
SEQ ED No. 740; 
SEQ ED No. 746; 
SEQ ED No. 753; 
SEQ ED No. 759; 
SEQ ED No. 769; 
SEQ ED No. 775; 
SEQ ID No. 780; 
SEQ ED No. 787; 
SEQ ED No. 798; 
SEQ ID No. 808; 
SEQ ED No. 814; 
SEQ ED No. 825; 
SEQ ED No. 833; 
SEQ ED No. 839; 
SEQ ED No. 845; 
SEQ ED No. 852; 
SEQ ID No. 859; 
SEQ ED No. 866; 
SEQ ED No. 878; 
SEQ ED No. 884; 
SEQ ED No. 893; 
SEQ ED No. 900; 
SEQ ED No. 910; 
SEQ ID No. 921; 
SEQ ED No. 926; 



SEQ ID No. 673; 
SEQ ID No. 680; 
SEQ ED No. 688; 
SEQ ED No. 693; 
SEQ ED No. 703; 
SEQ ED No. 709; 
SEQ ED No. 714; 
SEQ ED No. 721; 
SEQ ED No. 729; 
SEQ ED No. 734; 
SEQ ED No. 742; 
SEQ ID No. 748; 
SEQ ID No. 754; 
SEQ ED No. 760; 
SEQ ED No. 771; 
SEQ ID No. 776; 
SEQ ED No. 781; 
SEQ ED No. 788; 
SEQ ID No. 800; 
SEQ ED No. 809; 
SEQ ED No. 817; 
SEQ ED No. 827; 
SEQ ED No. 834; 
SEQ ED No. 840; 
SEQ ED No. 848; 
SEQ ID No. 854; 
SEQ ED No. 860; 
SEQ ED No. 869; 
SEQ ED No. 879; 
SEQ ED No. 885; 
SEQ ED No. 894; 
SEQ ID No. 901; 
SEQ ID No. 912; 
SEQ ED No. 922; 
SEQ ID No. 927; 
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SEQ ID No. 930; 
SEQ ID No. 937; 
SEQ ID No. 944; 
SEQ ID No. 952; 
SEQ ID No. 957; 
SEQ ID No. 963; 
SEQ ID No. 970; 
SEQ ID No. 980; 
SEQ ID No. 989; 



SEQ ID No. 933; 
SEQ ID No. 940; 
SEQ ID No. 945; 
SEQ ID No. 953; 
SEQ ID No. 958; 
SEQ ID No. 964; 
SEQ ID No. 971; 
SEQ ID No. 981; 
SEQ ID No. 990; 



37 

SEQ ID No. 934; 
SEQ ID No. 941; 
SEQ ID No. 947; 
SEQ ID No. 954; 
SEQ ID No. 960; 
SEQ ID No. 966; 
SEQ ID No. 973; 
SEQ ID No. 982; 



SEQ ID No. 935; 
SEQ ID No. 942; 
SEQ ID No. 948; 
SEQ ED No. 955; 
SEQ ID No. 961; 
SEQ ID No. 967; 
SEQ ID No. 974; 
SEQ ID No. 984; 
SEQ ID No. 995; 



SEQ ID No. 936 
SEQ ID No. 943 
SEQ ID No. 951 
SEQ ID No. 956 
SEQ ED No. 962 
SEQ ED No. 969 
SEQ ED No. 979 
SEQ ID No. 988 
SEQ ED No. 996 



SEQ ID No. 991; 

10 SEQ ID No. 999; SEQ ID No. 1001; SEQ ED No. 1003; SEQ ED No. 1004; SEQ ID No. 1005 
SEQ ED No. 1006; SEQ ED No. 1007; SEQ ED No. 1009; SEQ ED No. 1010; SEQ ID No. 101 1 
SEQ ED No. 1012; SEQ ED No. 1013; SEQ ED No. 1014; SEQ ID No. 1016; SEQ ED No. 1017 
SEQ ED No. 1018; SEQ ED No. 1020; SEQ ID No. 1021; SEQ ED No. 1025; SEQ ED No. 1026 
SEQ ID No. 1027; SEQ ED No. 1029; SEQ ED No. 1030; SEQ ED No. 103 1 ; SEQ ED No. 1035 

15 SEQ ED No. 1036; SEQ ED No. 1037; SEQ ED No. 1038; SEQ ED No. 1039; SEQ ED No. 1040 
SEQ ED No. 1044; SEQ ED No. 1045; SEQ ED No. 1047; SEQ ID No. 1048; SEQ ED No. 1050 
SEQ ED No. 1051; SEQ ED No. 1052; SEQ ED No. 1053; SEQ ED No. 1055; SEQ ED No. 1056 
SEQ ID No. 1057; SEQ ED No. 1058; SEQ ED No. 1061; SEQ ED No. 1062; SEQ ED No. 1063 
SEQ ED No. 1064; SEQ ED No. 1065; SEQ ED No. 1066; SEQ ED No. 1068; SEQ ED No. 1069 

20 SEQ ED No. 1072; SEQ ED No. 1074; SEQ ID No. 1 076 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis transmembrane polypeptide or one of its 
representative fragments, having between 4 and 6 transmembrane domains, and in that it is chosen 
from the polypeptides having the following sequences: 

25 SEQ ED No. 7; SEQ ED No. 14; SEQ ED No. 16; SEQ ED No. 32; SEQ ID No. 34; SEQ ED No. 36 
SEQ ED No. 38; SEQ ED No. 50; SEQ ED No. 57; SEQ ED No. 59; SEQ ED No. 61 ; SEQ ED No. 62 
SEQ ED No. 63; SEQ ED No. 64; SEQ ED No. 67; SEQ ED No. 69; SEQ ED No. 72; SEQ ED No. 77 
SEQ ED No. 80; SEQ ID No. 84; SEQ ED No. 87; SEQ ED No. 93; SEQ ED No. 95; SEQ ED No. 99 



SEQ ED No. 108; 
30 SEQ ED No. 131; 

SEQ ED No. 154; 

SEQ ED No. 182; 

SEQ ED No. 239; 

SEQ ED No. 259; 
35 SEQ ED No. 286; 



SEQ ED No. 119 
SEQ ED No. 136 
SEQ ED No. 160 
SEQ ED No. 185 
SEQ ED No. 242 
SEQ ED No. 262 
SEQ ID No. 288 



SEQ ED No. 125 
SEQ ED No. 139 
SEQ ED No. 161 
SEQ ED No. 200 
SEQ ED No. 250 
SEQ ID No. 268 
SEQ ED No. 292 



SEQ ID No. 126; 
SEQ ED No. 146; 
SEQ ID No. 172; 
SEQ ID No. 203; 
SEQ ID No. 253; 
SEQ ID No. 275; 
SEQ ID No. 295; 



SEQ ED No. 129 
SEQ ED No. 152 
SEQ ED No. 179 
SEQ ID No. 205 
SEQ ED No. 256 
SEQ ED No. 281 
SEQ ED No. 296 
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SEQIDNo.297; SEQIDNo.299; SEQIDNo.300; SEQIDNo.308; SEQEDNo.314 

SEQ ED No. 317; SEQIDNo.318; SEQIDNo.324; SEQ ED No. 342; SEQIDNo.343 

SEQ ID No. 355; SEQ ID No. 360; SEQ ED No. 374; SEQIDNo.376; SEQ ED No. 386 

SEQIDNo.388; SEQ ED No. 392; SEQIDNo.394; SEQ ED No. 395; SEQ ID No. 402 

5 SEQIDNo.405; SEQ ED No. 411; SEQIDNo.415; SEQIDNo.416; SEQIDNo.422 

SEQIDNo.423; SEQ ED No. 429; SEQIDNo.432; SEQIDNo.441; SEQ ID No. 442 

SEQ ID No. 444; SEQIDNo.449; SEQ ED No. 452; SEQIDNo.456; SEQIDNo.460 

SEQIDNo.461; SEQIDNo.465; SEQIDNo.471; SEQIDNo.472; SEQIDNo.482 

SEQIDNo.489; SEQIDNo.492; SEQ ED No. 494; SEQIDNo.495; SEQ ED No. 502 

10 SEQIDNo.505; SEQ ID No. 506; SEQIDNo.509; SEQIDNo.516; SEQIDNo.517 

SEQIDNo.520; SEQIDNo.525; SEQIDNo.533; SEQIDNo.539; SEQ ID No. 549 

SEQIDNo.554; SEQIDNo.557; SEQ ED No. 563; SEQ ID No. 570; SEQ ED No. 573 

SEQIDNo.581; SEQ ID No. 590; SEQIDNo.591; SEQ ID No. 600; SEQIDNo.607 

SEQIDNo.612; SEQ ID No. 613; SEQ ID No. 620; SEQ ID No. 626; SEQ ID No. 629 

15 SEQ ID No. 630; SEQ ED No. 639; SEQ ID No. 644; SEQ ED No. 647; SEQ ED No. 656 

SEQ ID No. 659; SEQ ED No. 661; SEQ ED No. 685; SEQ ED No. 687; SEQ ED No. 699 

SEQ ED No. 700; SEQ ID No. 708; SEQ ED No. 716; SEQ ED No. 719; SEQ ED No. 725 

SEQ ID No. 747; SEQ ED No. 749; SEQ ED No. 756; SEQ ED No. 765; SEQ ID No. 767 

SEQ ED No. 794; SEQ ED No. 796; SEQ ED No. 797; SEQ ED No. 799; SEQ ED No. 801 

20 SEQ ED No. 807; SEQ ED No. 821; SEQ ED No. 823; SEQ ED No. 826; SEQ ED No. 847 

SEQ ED No. 853; SEQ ID No. 861; SEQ ED No. 870; SEQ ED No. 871; SEQ ID No. 875 

SEQ ID No. 882; SEQ ED No. 888; SEQ ID No. 889; SEQ ED No. 898; SEQ ED No. 902 

SEQ ED No. 903; SEQ ED No. 911; SEQ ED No. 916; SEQ ED No. 931; SEQ ED No. 939 

SEQ ED No. 975; SEQ ID No. 976; SEQ ED No. 978; SEQ ED No. 983; SEQ ED No. 986 

25 SEQ ED No. 987; SEQ ED No. 992; SEQ ED No. 993; SEQ ED No. 1000; SEQ ID No. 1002 

SEQ ED No. 1008; SEQ ED No. 1019; SEQ ED No. 1022; SEQ ID No. 1032; SEQ ED No. 1034; 
SEQ ID No. 1046; SEQ ID No. 1054; SEQ ED No. 1 060; SEQ ED No. 1071 and one of their 
representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
30 characterized in that it is a Chlamydia trachomatis transmembrane polypeptide or one of its 
representative fragments, having at least 7 transmembrane domains, and in that it is chosen from the 
polypeptides having the following sequences: SEQ ED No. 4; SEQ ED No. 6; SEQ ED No. 13 
SEQ ID No. 20; SEQ ED No. 51; SEQ ID No. 71; SEQ ED No. 88; SEQ ID No. 118; SEQ ED No. 128 

SEQ ED No. 132; SEQ ED No. 133; SEQ ED No. 158; SEQ ED No. 159; SEQ ED No. 174 

35 SEQ ID No. 180; SEQ ED No. 189; SEQ ED No. 210; SEQ ED No. 211; SEQ ID No. 214 
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SEQIDNo.215; SEQIDNo.226; SEQIDNo.229; SEQIDNo.233; SEQIDNo.235 
SEQIDNo.240; SEQIDNo.246; SEQIDNo.251; SEQIDNo.255; SEQIDNo.273 
SEQIDNo.354; SEQDDNo.364; SEQIDNo.369; SEQJDNo.371; SEQIDNo. 397 
SEQIDNo.401; SEQIDNo.409; SEQIDNo.412; SEQIDNo.419; SEQ ID No. 439 
5 SEQIDNo.453; SEQ ID No. 462; SEQIDNo.490; SEQ ID No. 510; SEQ ID No. 511 
SEQIDNo.518; SEQIDNo.535; SEQIDNo.548; SEQ ID No. 550; SEQIDNo.564 
SEQIDNo.565; SEQIDNo.578; SEQIDNo.579; SEQIDNo.614; SEQIDNo.631 
SEQIDNo.636; SEQIDNo.650; SEQ ID No. 662; SEQIDNo.667; SEQIDNo.679 
SEQIDNo.681; SEQIDNo.702; SEQIDNo.727; SEQIDNo.741; SEQIDNo.763 

10 SEQIDNo.791; SEQIDNo.792; SEQIDNo.815; SEQIDNo.816; SEQ ID No. 832 
SEQIDNo.846; SEQIDNo.858; SEQIDNo.865; SEQIDNo.867; SEQIDNo.868 
SEQIDNo.877; SEQIDNo.891; SEQlDNo.896; SEQIDNo.907; SEQIDNo.908 
SEQIDNo.918; SEQIDNo.919; SEQIDNo.932; SEQIDNo.959; SEQIDNo.977 
SEQIDNo.994; SEQIDNo.998; SEQ ID No. 1024; SEQIDNo. 1028; SEQIDNo. 1042 

15 SEQ ID No. 1067; SEQ ID No. 1070; SEQ ID No. 1073 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis surface exposed polypeptide or one of its representative fragments, and in 
that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 2, SEQ ID No. 3, SEQ ID No. 21, SEQ ID No. 22. SEQ ID No. 23, SEQ ID No. 53, SEQ 

20 ID No. 77, SEQ ID No. 187, SEQ ID No. 203, SEQ ID No. 383, SEQ ID No. 477, SEQ ID No. 478, 
SEQ ID No. 479, SEQ ID No. 481, SEQ ID No. 482, SEQ ID No. 483, SEQ ID No. 484, SEQ ID No. 
485, SEQ ID No. 486, SEQ ID No. 487, SEQ ID No. 488, SEQ ID No. 489, SEQ ID No. 490, SEQ ID 
No. 571, SEQ ID No. 572, SEQ ID No. 573, SEQ ID No. 593, SEQ ID No. 670, SEQ ID No. 693, 
SEQ ID No. 742, SEQ ID No. 749, SEQ ID No. 801, SEQ ID No. 817, SEQ ID No. 818, SEQ ID No. 

25 819, SEQ ID No. 820, SEQ ID No. 851, SEQ ID No. 902, SEQ ID No. 923, SEQ ID No. 1035, SEQ 
ID No. 1036, SEQ ID No. 1037, SEQ ID No. 1038, SEQ ID No. 1069, SEQ ID No. 1070, SEQ ID No. 
1071, SEQ ID No. 1073, SEQ ID No. 1076, SEQ ID No. 1095, SEQ ID No. 1096, SEQ ID No. 1141, 
SEQ ID No. 1181, and their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 

30 characterized in that it is a Chlamydia trachomatis lipoprotein or one of its representative fragments, 
and in that it is chosen from the polypeptides having the following sequences: 
SEQ ID No. 29, SEQ ID No. 42, SEQ ID No. 66, SEQ ID No. 72, SEQ ID No. 76, SEQ ID No. 78, 
SEQ ID No. 148, SEQ ID No. 154, SEQ ID No. 180, SEQ ID No. 182, SEQ ID No. 184, SEQ ID No. 
187, SEQ ID No. 200, SEQ ID No. 242, SEQ ID No. 245, SEQ ID No. 250, SEQ ID No. 253, SEQ ID 

35 No. 272, SEQ ID No. 274, SEQ ID No. 275, SEQ ID No. 308, SEQ ID No. 350, SEQ ID No. 362, 
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SEQ ID No. 383, SEQ ID No. 394, SEQ ID No. 396, SEQ ID No. 399, SEQ ID No. 422, SEQ ID No. 
488, SEQ ID No. 535. SEQ ID No. 568, SEQ ID No. 573, SEQ ID No. 578, SEQ ID No. 593, SEQ ID 
No. 607, SEQ ID No. 625, SEQ ID No. 662, SEQ ID No. 669. SEQ ID No. 688, SEQ' ID No. 690, 
SEQ ED No. 716. SEQ ID No. 773, SEQ ID No. 778, SEQ ID No. 781, SEQ ID No. 783, SEQ ID No. 
5 788, SEQ ID No. 817, SEQ ID No. 848, SEQ ID No. 851, SEQ ID No. 853, SEQ ID No. 857, SEQ ID 
No. 875, SEQ ID No. 877, SEQ ID No. 886, SEQ ID No. 898, SEQ ID No. 902, SEQ ID No. 923, 
SEQ ID No. 938, SEQ ID No. 976, SEQ ID No. 978, SEQ ID No. 990, SEQ ID No. 1005, SEQ ID No. 
1021, SEQ ID No. 1035, SEQ ID No. 1069, SEQ ID No. 1083, SEQ ID No. 1088, SEQ ID No. 1089, 
SEQ ID No. 1091, SEQ ID No. 1092, SEQ ID No. 1095, SEQ ID No. 1096, SEQ ID No. 1 100, SEQ 
10 ID No. 1 105, SEQ ID No. 1 108, SEQ ID No. 1 1 17, SEQ ID No. 1 120, SEQ ID No. 1 121, SEQ ID No. 
1124, SEQ ID No. 1128, SEQ ID No. 1133, SEQ ID No. 1135, SEQ ID No. 1139, SEQ ID No. 1140, 
SEQ ID No. 1 157, SEQ ID No. 1 159, SEQ ID No. 1 163, SEQ ID No. 1 165, SEQ ID No. 1 167, SEQ 
ID No. 1 168, SEQ ID No. 1 169, SEQ ID No. 1 171, SEQ ID No. 1 173, SEQ ID No. 1 174, SEQ ID No. 
1177, SEQ ID No. 1 180, SEQ ID No. 1181, SEQ ID No. 1186, SEQ ID No. 1194, SEQ ID No. 1197, 
15 and their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis polypeptide involved in lipopolysaccharide (LPS) biosynthesis, and in 
that it is chosen from the polypeptides having the following sequences: SEQ ID No. 17, SEQ ID No. 
201, SEQ ID No. 691, SEQ ID No. 807, SEQ ID No. 936, SEQ ID No. 983, SEQ ID No. 1019, SEQ 
20 ID No. 1077, and their representative fragments. 

Preferably, the invention relates to additional LPS-related polypeptides according to 
the invention, in that it is: 

(a) a Chlamydia trachomatis KDO (3-deoxy-D-manno-octyIosonic acid)-related 
polypeptide or one of its representative fragments, and in that it is chosen from the polypeptides 

25 having the following sequences: SEQ ID No. 41, SEQ ID No. 242, SEQ ID No. 269, SEQ ID No. 772, 
and one of their representative fragments; 

(b) a Chlamydia trachomatis phosphomannomutase-related polypeptide or one of 
its representative fragments, and in that it is chosen from the polypeptides having the following 
sequence: SEQ ID No. 139, and its representative fragments; 

30 ( c ) a Chlamydia trachomatis phosphoglucomutase-related polypeptide or one of 

its representative fragments, and in that it is chosen from the polypeptides having the following 
sequence: SEQ ID No. 567 and its representative fragments; and 

(d) a Chlamydia trachomatis lipid A component-related polypeptide or one of its 
representative fragments, and in that it is chosen from the polypeptides having the following 

35 sequences: SEQ ID No. 4, SEQ ID No. 933, SEQ ID No. 934, SEQ ID No. 935. SEQ ID No. 1185, 



WO 99/28475 PCT/IB98/01939 

41 

and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis polypeptide or one of its representative fragments that contains an RGD 
sequence and is also an outer membrane protein, and in that it is chosen from the polypeptides having 
5 the following sequences: SEQ. ID No. 488, SEQ ID No. 489, SEQ ID No. 571, SEQ ID No. 572, SEQ 
No. 573, SEQ ID No. 716 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis polypeptide or one of its representative fragments that is cysteine-rich and 
contains RGD sequence, and in that it is chosen from the polypeptides having the following sequence: 

10 SEQ ID No. 144 and one of its representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis outer membrane polypeptide that contains cysteines in their first 30 amino 
acids and also contain an RGD sequence, and in that it is chosen from the polypeptides having the 
following sequences: SEQ ID No. 101, SEQ ID No. 122, SEQ ID No. 308, SEQ ID No. 488, SEQ ID 

15 No. 489, SEQ ID No. 571, SEQ ID No. 572, SEQ ID No. 573, SEQ ID No. 651, SEQ ID No. 679, 
SEQ ID No. 680, SEQ ID No. 705, SEQ ID No. 716, SEQ ID No. 763, SEQ ID No. 870, SEQ ID No. 
878, SEQ ID No. 879, SEQ ID No. 995, SEQ ID No. 1028, SEQ ID No. 1029, SEQ ID No. 1 176, and 
one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 

20 is a Chlamydia trachomatis polypeptide or one of its representative fragments that contains RGD 
sequences homologous to Chlamydia pneumoniae polypeptides containing RGD sequences, and in 
that it is chosen from the polypeptides having the following sequences: SEQ ID No. 28, SEQ ID No. 
101, SEQ ID No. 125, SEQ ID No. 155, SEQ ID No. 156, SEQ ID No. 286, SEQ ID No. 571, SEQ ID 
No. 572, SEQ ID No. 573, SEQ ID No. 763, SEQ ID No. 870, and one of their representative 

25 fragments. 

Preferably, the invention relates to a polypeptide according to the invention, in that it 
is a Chlamydia trachomatis Type m or non-Type m secreted polypeptide or one of its representative 
fragments, and in that it is chosen from the polypeptides having the following sequences: 
SEQ ID No. 180, SEQ ID No. 181, SEQ ID No. 207, SEQ ID No. 208, SEQ ID No. 372, SEQ ID No. 

30 391, SEQ ID No. 399, SEQ ID No. 477, SEQ ID No. 486, SEQ ID No. 749, SEQ ID No. 758, SEQ ID 
No. 819, SEQ ID No. 878, SEQ ID No. 888, SEQ ID No. 896, SEQ ID No. 897, SEQ ID No. 900, 
SEQ ID No. 902, SEQ ID No. 923, SEQ ID No. 1015, SEQ ID No. 1018, SEQ ED No. 1059, SEQ ID 
No. 1060, SEQ ID No. 1069, SEQ ID No. 1071, SEQ ID No. 1073, SEQ ID No. 1076, SEQ ID No. 
1 1 89, and their representative fragments. 

35 Preferably, the invention relates to a polypeptide according to the invention, in that it 



WO 99/28475 PCT/1B98/01939 

42 

is a Chlamydia trachomatis cell wall anchored surface polypeptide or one of its representative 

fragments, and in that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 662, SEQ ID No. 681, SEQ ID No. 1182. SEQ ID No. 1192, and their representative 

fragments. 

5 Preferably, the invention relates to a polypeptide according to the invention, in that it 

is a Chlamydia trachomatis polypeptide or one of its representative fragments not found in Chlamydia 
pneumoniae (Blastp P>e" 10 ) and in that it is chosen from the polypeptides having the following 
sequences: SEQ ID No.2, SEQ ID No. 18, SEQ ID No. 60, SEQ ID No. 66, SEQ ID No. 67, SEQ ID 
No.68, SEQ ID No. 69, SEQ ID No. 70, SEQ ID No. 81, SEQ ID No. 89, SEQ ID No. 107, SEQ ID 
10 No.108, SEQ ID No. 109, SEQ ID No.134, SEQ ID No. 147, SEQ ID No.191, SEQ ID No. 194, SEQ 
ID No. 216, SEQ ID No. 217, SEQ ID No. 218, SEQ ID No. 219, SEQ ID No. 220, SEQ ID No. 221, 
SEQ ID No. 222, SEQ ID No. 222, SEQ ID No. 223, SEQ ID No. 224, SEQ ID No. 225, SEQ ID No. 
228, SEQ ID No. 235, SEQ ID No.257, SEQ ID No. 276, SEQ ID No. 277, SEQ ID No. 278, SEQ ID 
No. 279, SEQ ID No. 280, SEQ ID No. 281, SEQ ID No. 282, SEQ ID No.283, SEQ ID No. 284, 
15 SEQ ID No. 285, SEQ ID No. 289, SEQ ID No.291, SEQ ID No. 298, SEQ ID No. 284, SEQ ID No. 
313, SEQ ID No. 314, SEQ ID No. 315, SEQ ID No. 316, SEQ ID No. 334, SEQ ID No. 335, SEQ ID 
No. 336, SEQ ID No. 337, SEQ ID No. 338, SEQ ID No. 339, SEQ ID No. 340, SEQ ID No. 381, 
SEQ ID No. 393, SEQ ID No. 413, SEQ ID No. 418, SEQ ID No. 419, SEQ ID No. 419, SEQ ID No. 
420, SEQ ID No. 421, SEQ ID No. 422, SEQ ID No. 423, SEQ ID No. 436, SEQ ID No. 460, SEQ ID 
20 No. 475, SEQ ID No. 476, SEQ ID No. 480, SEQ ID No. 485, SEQ ID No. 487, SEQ ID No.491, SEQ 
ID No. 492, SEQ ID No. 493, SEQ ID No. 494, SEQ ID No. 496, SEQ ID No. 500, SEQ ID No.504, 
SEQ ID No. 514, SEQ ID No. 527, SEQ ID No. 559, SEQ ID No.569, SEQ ID No. 570, SEQ ID No. 
575, SEQ ID No. 580, SEQ ID No. 582, SEQ ID No. 593, SEQ ID No. 598, SEQ ID No.632, SEQ ID 
No.640, SEQ ID No.651, SEQ ID No.671, SEQ ID No. 690, SEQ ID No. 694, ID No. 698, SEQ ID 
25 No. 710, SEQ ID No. 722. SEQ ID No. 723, SEQ ID No. 724, SEQ ID No. 770..SEQ ID No. 771, 
SEQ ID No.782, SEQ ID No. 783, SEQ ID No. 784, SEQ ID No. 790, SEQ ID No. 795, SEQ ID No. 
798, SEQ ID No. 805, SEQ ID No. 810, SEQ ID No. 817, SEQ ID No. 829, SEQ ID No. 830, SEQ ID 
No. 864, SEQ ID No. 866, SEQ ID No. 876, SEQ ID No. 887, SEQ ID No. 892, SEQ ID No. 899, 
SEQ ID No. 913, SEQ ID No. 921, SEQ ID No. 933, SEQ ID No. 938, SEQ ID No. 949, SEQ ID No. 
30 956, SEQ ID No. 1010, SEQ ID No. 1017, SEQ ID No. 1018, SEQ ID No. 1027, SEQ ID No. 1030, 
SEQ ID No. 1037, SEQ ID No. 1038, SEQ ID No. 1047, SEQ ID No. 1072, SEQ ID No. 1074, SEQ 
ID No. 1075, SEQ ID No. 1078, SEQ ID No. 1079, SEQ ID No. 1081, SEQ ID No. 1083. SEQ ID No. 
1084, SEQ ID No. 1087, SEQ ID No. 1088, SEQ ID No. 1089, SEQ ID No. 1091, SEQ ID No. 1092, 
SEQ ID No. 1094, SEQ ID No. 1095, SEQ ID No. 1096, SEQ ID No. 1098, SEQ ID No. 1 104, SEQ 
35 ID No. 1 105, SEQ ID No. 1 106, SEQ ID No. 1 108, SEQ ID No. 1 1 10, SEQ ID No. 1 1 14, SEQ ID No. 
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1115, SEQ ED No. 1116, SEQIDNo. 1117, SEQ ID No. 1 119. SEQ ID No. 1128, SEQ ID No. 1132, 
SEQ ID No. 1 133, SEQ ID No. 1 135, SEQ ID No. 1 136, SEQ ID No. 1 139, SEQ ID No. 1 140, SEQ 
ID No. 1 141, SEQ ID No. 1 142, SEQ ID No. 1 144, SEQ ID No. 1 148, SEQ ID No. 1 151. SEQ ID No. 
1 155, SEQ ID No. 1 157, SEQ ID No. 1 159, SEQ ID No. 1 161, SEQ ED No. 1 162, SEQ ID No. 1 165, 
SEQ ID No. 1166, SEQ ID No. 1167, SEQ ID No. 1168, SEQ ID No. 1169, SEQ ID No. 1171, SEQ 
ID No. 1 172, SEQ ID No. 1 173, SEQ ID No. 1 174, SEQ ID No. 1 175, SEQ ID No. 1 176, SEQ ID No. 
1177, SEQIDNo. 1178, SEQIDNo. 1180, SEQ ID No. 1181, SEQ ID No. 1183, SEQIDNo. 1184, 
SEQ ID No. 1 186, SEQ ID No. 1 187, SEQ ID No. 1 188, SEQ ID No. 1 192, SEQ ID No. 1194, SEQ 
ID No. 1 197, and their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the intermediate metabolism, in particular in the metabolism of sugars and/or of 
cofactors, and in that it is chosen from the polypeptides having the following sequences: 
SEQIDNo. 10; SEQ ID No. 44; SEQ ID No. 45; SEQ ID No. 46; SEQ ID No. 47; SEQ ID No. 93 



15 SEQIDNo. 101; 

SEQIDNo. 120; 

SEQ ID No. 143; 

SEQIDNo. 160; 

SEQIDNo. 197; 
20 SEQ ID No. 236; 

SEQ ID No. 264; 

SEQ ID No. 359; 

SEQ ID No. 405; 

SEQ ED No. 417; 
25 SEQ ID No. 495; 

SEQ ED No. 566; 

SEQ ED No. 647; 

SEQ ED No. 673; 

SEQ ED No. 700; 
30 SEQ ED No. 836; 

SEQ ED No. 863; 



SEQ ED No. 102 
SEQIDNo. 121 
SEQ ED No. 144 
SEQ ED No. 161 
SEQIDNo. 198 
SEQIDNo. 236: 
SEQ ED No. 265 
SEQIDNo. 360 
SEQ ED No. 410 
SEQ ED No. 432 
SEQ ED No. 513 
SEQIDNo. 589 
SEQ ID No. 652; 
SEQ ID No. 674 
SEQ ED No. 725 
SEQ ED No. 837 
SEQ ED No. 869 



SEQIDNo. 103; 
SEQIDNo. 130; 
SEQIDNo. 145; 
SEQIDNo. 192; 
SEQ ID No. 199; 
SEQ ED No. 239; 
SEQ ID No. 297; 
SEQ ED No. 374; 
SEQ ED No. 415; 
SEQ ED No. 460; 
SEQ ED No. 515; 
SEQ ED No. 613 ; 
SEQ ED No. 653; 
SEQ ID No. 682; 
SEQIDNo. 801; 
SEQIDNo. 860; 



SEQIDNo. 106; 
SEQIDNo. 135; 
SEQIDNo. 158; 
SEQIDNo. 193; 
SEQ ED No. 227; 
SEQ ED No. 243; 
SEQIDNo. 331; 
SEQ ED No. 404; 
SEQIDNo. 415; 
SEQ ID No. 461; 
SEQIDNo. 566; 
SEQ ED No. 645; 
SEQ ED No. 654; 
SEQIDNo. 684; 
SEQ ED No. 802; 
SEQIDNo. 861; 
SEQ ED No. 925; 



SEQIDNo. 107: 
SEQIDNo. 140 
SEQIDNo. 159 
SEQIDNo. 196 
SEQ ED No. 229 
SEQ ID No. 245 
SEQ ED No. 333 
SEQ ID No. 405 
SEQ ED No. 416 
SEQ ED No. 462 
SEQ ED No. 566 
SEQ ID No. 646 
SEQIDNo. 672 
SEQ ID No. 692 
SEQ ID No. 835 
SEQ ED No. 862 
SEQ ED No. 964 



SEQ ED No. 869; 
SEQ ED No. 983 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
35 which is involved in the intermediate metabolism of nucleotides or nucleic acids, and in that it is 
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chosen from the polypeptides having the following sequences: 



SEQIDNo. 142 
SEQ ID No. 352 
SEQIDNo. 529 
SEQ ID No. 602 



SEQIDNo. 169 
SEQIDNo. 366 
SEQIDNo. 530: 
SEQIDNo. 617 



SEQ ID No. 256 
SEQIDNo. 435 
SEQIDNo. 548 
SEQIDNo. 619 



SEQ ID No. 268; 
SEQ ID No. 444; 
SEQ ID No. 549; 
SEQ ID No. 644; 



SEQ ID No. 971; SEQ ID No. 972; SEQ ID No. 1023 and one of their representative 



SEQIDNo. 142 
SEQIDNo. 325 
SEQ ID No. 528 
5 SEQ ID No. 601 
SEQ ID No. 745 
fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
10 which is involved in the metabolism of nucleic acids, and in that it is chosen from the polypeptides 
having the following sequences: 

SEQ ID No. 5; SEQIDNo. 12; SEQ ID No. 82; SEQ ID No. 96; SEQ ID No. 97; SEQ ID No. 98 
SEQ ID No. 99; SEQIDNo. 100; SEQIDNo. 105; SEQIDNo. 118; SEQ ID No. 136 

SEQ ID No. 190; 
SEQIDNo. 290; 
SEQ ID No. 427; 
SEQIDNo. 595; 
SEQ ID No. 605; 
SEQ ID No. 657; 
SEQ ID No. 708; 
SEQIDNo. 779; 
SEQ ID No. 842; 
SEQ ID No. 924; 
SEQ ID No. 963; 

25 SEQ ID No. 975; SEQ ID No. 979; SEQ ID No. 995; SEQ ID No. 1031; SEQ ID No. 1032 and one of 
their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the metabolism of amino acids or polypeptides, and in that it is chosen from the 

30 polypeptides having the following sequences: 

SEQ ID No. 27; SEQ ID No. 41; SEQ ID No. 55; SEQ ID No. 56; SEQ ID No. 57; SEQ ID No. 59 
SEQ ID No. 62; SEQ ID No. 63; SEQ ID No. 64; SEQ ID No. 65; SEQ ID No. 119; SEQ ID No. 132 



SEQIDNo. 137; 
15 SEQ ID No. 260; 

SEQ ID No. 302; 

SEQ ID No. 444; 

SEQ ID No. 599; 

SEQ ED No. 625; 
20 SEQ ID No. 703; 

SEQIDNo. 767; 

SEQ ID No. 794; 

SEQ ED No. 907; 

SEQ ID No. 962; 



SEQIDNo. 163; 
SEQIDNo. 262; 
SEQ ID No. 387; 
SEQ ID No. 471; 
SEQ ID No. 600; 
SEQ ID No. 650; 
SEQ ID No. 704; 
SEQIDNo. 775; 
SEQIDNo. 841; 
SEQ ID No. 918; 
SEQ ID No. 962; 



SEQ ED No. 204; 
SEQIDNo. 300; 
SEQ ID No. 434; 
SEQ ID No. 596; 
SEQ ID No. 612; 
SEQ ID No. 658; 
SEQ ID No. 719; 
SEQ ID No. 787; 
SEQ ID No. 883; 
SEQ ID No. 928; 
SEQIDNo. 969; 



SEQ ID No. 259 
SEQIDNo. 301 
SEQ ID No. 441 
SEQIDNo. 597 
SEQ ID No. 624 
SEQIDNo. 702 
SEQ ID No. 766 
SEQ ID No. 788 
SEQ ID No. 884 
SEQ ID No. 929 
SEQ ID No. 970 



SEQ ID No. 240 
SEQ ID No. 382 
35 SEQ ID No. 447 



SEQ ID No. 241 
SEQ ID No. 406 
SEQ ID No. 453 



SEQ ID No. 277 
SEQ ID No. 428 
SEQ ID No. 454 



SEQ ID No. 278 
SEQ ID No. 442 
SEQ ID No. 541 



SEQIDNo. 279 
SEQ ID No. 446 
SEQ ID No. 542 
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SEQ ID No. 648 
SEQIDNo. 717 
SEQ ID No. 920 
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SEQ ID No. 608; SEQ ID No. 609; 
SEQ ED No. 649; SEQ ID No. 660; 
SEQ ID No. 765; SEQ ID No. 797 ; 
SEQ ID No. 922; SEQ ID No. 937; 



SEQ ED No. 610; SEQIDNo. 618 

SEQ ID No. 661 ; SEQ ID No. 677 

SEQ ID No. 871; SEQTD No. 875 

SEQ ID No. 998; SEQ ID No. 1020 



5 SEQ ID No. 1021; SEQ ID No. 1034; SEQ ED No. 1044; SEQ ID No. 1046; SEQ ID No. 1049 and 
one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the metabolism of polypeptides, and in that it is chosen from the polypeptides 
10 having the following sequences: 

SEQ ID No. 21; SEQ ID No. 22; SEQ ID No. 23; SEQ ID No. 24; SEQ ID No. 25; SEQ ID No. 26; 
SEQ ID No. 75; SEQ ED No. 84; SEQ ED No. 86; SEQ ID No. 92; SEQ ED No. 133; SEQ ED No. 151; 



SEQ ID No. 152; 
SEQ ID No. 326; 
15 SEQ ED No. 429; 
SEQ ED No. 630; 
SEQIDNo. 751; 



SEQ ED No. 157; 
SEQ ED No. 343; 
SEQ ED No. 519; 
SEQ ID No. 656; 
SEQ ED No. 752; 



SEQ ED No. 209 
SEQ ED No. 345 
SEQ ED No. 586 
SEQ ED No. 707 
SEQ ED No. 847 



SEQ ED No. 307; 
SEQIDNo. 371; 
SEQIDNo. 587; 
SEQ ID No. 730; 
SEQ ID No. 885; 



20 



SEQ ID No. 179 
SEQ ED No. 344 
SEQ ID No. 557 
SEQ ID No. 706 
SEQ ED No. 786 

SEQ ID No. 923; SEQ ED No. 978; SEQ ID No. 1039; SEQIDNo. 1048 and one of their 
representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the metabolism of fatty acids, and in that it is chosen from the polypeptides 
having the following sequences: 

SEQ ED No. 4; SEQ ED No. 15; SEQ ED No. 16; SEQ ID No. 141; SEQ ED No. 173; SEQ ED No. 205; 



25 SEQ ID No. 205 
SEQ ED No. 355 
SEQ ED No. 561 
SEQ ED No. 590 
SEQ ID No. 904 

30 SEQ ED No. 934 



SEQ ED No. 206 
SEQ ED No. 415 
SEQ ID No. 574 
SEQ ID No. 614 
SEQ ED No. 905 



SEQ ED No. 208 
SEQ ID No. 558 
SEQ ED No. 577 
SEQ ID No. 808 
SEQ ED No. 933 



SEQ ED No. 312; 
SEQIDNo. 560; 
SEQIDNo. 578; 
SEQ ID No. 809; 
SEQ ED No. 934; 



SEQ ED No. 207 
SEQ ED No. 550 
SEQ ED No. 574 
SEQ ED No. 772 
SEQ ED No. 905 
; SEQ ED No. 936 and one of their representative fragments. 
Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the synthesis of the wall, and in that it is chosen from the polypeptides having the 
following sequences: 

35 SEQ ED No. 87; SEQIDNo. 196; SEQ ED No. 242; SEQ ED No. 269; SEQ ED No. 628; 
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SEQ ID No. 364 
SEQ ID No. 744 
SEQ ID No. 982; 



SEQ ID No. 213 
SEQ ID No. 407 
SEQ ID No. 746 
SEQ ID No. 1003 
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SEQ ID No. 629; SEQ ID No. 634; SEQ ID No. 635; SEQ ID No. 637; SEQ ID No. 638; 
SEQ ID No. 1019 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
5 which is involved in the transcription, translation and/or maturation process, and in that it is chosen 
from the polypeptides having the following sequences: 
SEQ ID No. 112; SEQ ID No. 1 13; SEQ ID No. 332; 
SEQ ID No. 350; SEQ ID No. 362; SEQ ID No. 363; 
SEQ ID No. 451; SEQ ID No. 546; SEQ ID No. 643; 
10 SEQ ID No. 833; SEQ ID No. 868; SEQ ID No. 981; 

SEQ ID No. 101 1; SEQ ID No. 1042 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis ribosomal polypeptide or one of its representative 
fragments, and in that it is chosen from the polypeptides having the following sequences: 
15 SEQ ID No. 114; SEQ ID No. 115; SEQ ID No. 116; SEQ ID No. 328; SEQ ID No. 361 

SEQ ID No. 543; 
SEQ ID No. 941; 
SEQ ID No. 948; 
SEQ ID No. 954; 
SEQ ID No. 960; 

SEQ ID No. 1041; SEQ ID No. 1043; SEQ ID No. 1063; SEQ ID No. 1064 and one of their 
fragments. 

Preferably, the invention also relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis transport polypeptide or one of its representative 
25 fragments, and in that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 6; SEQ ID No. 50; SEQ ID No. 51; SEQ ID No. 80; SEQ ID No. 125; SEQ ID No. 126 
SEQ ID No. 128; SEQ ID No. 129; SEQ ID No. 215; SEQ ID No. 246; SEQ ID No. 248 

SEQ ID No. 252; 
SEQ ID No. 293; 
SEQ ID No. 401; 
SEQ ID No. 565; 
SEQ ID No. 712; 
SEQ ID No. 731; 
SEQ ID No. 908; 
SEQ ID No. 992; 



SEQ ED No. 375; 
SEQ ID No. 743; 
SEQ ID No. 946; 
SEQ ID No. 952; 
20 SEQ ID No. 957; 



SEQ ID No. 445; 
SEQ ED No. 813; 
SEQ ID No. 947; 
SEQ ID No. 953; 
SEQ ID No. 958; 



SEQ ID No. 584; 
SEQ ID No. 942; 
SEQ ID No. 950; 
SEQ ID No. 955; 
SEQ ID No. 961; 



SEQ ID No. 585 
SEQ ID No. 944 
SEQ ID No. 951 
SEQ ID No. 955 
SEQ ID No. 1040 



SEQ ID No. 249; 

SEQ ID No. 271; 
30 SEQ ED No. 324; 

SEQ ID No. 512; 

SEQ ID No. 680; 

SEQ ED No. 715; 

SEQ ID No. 738; 
35 SEQ ID No. 987; 



SEQ ED No. 251; 
SEQ ED No. 275; 
SEQ ED No. 398; 
SEQ ED No. 564; 
SEQ ID No. 711; 
SEQ ID No. 730; 
SEQ ID No. 870; 
SEQ ID No. 988; 



SEQ ED No. 253; 
SEQ ED No. 309; 
SEQ ED No. 449; 
SEQ ED No. 667; 
SEQ ID No. 713; 
SEQ ID No. 736; 
SEQ ID No. 919; 
SEQ ED No. 993; 



SEQ ED No. 255 
SEQ ID No. 323 
SEQ ID No. 511 
SEQ ED No. 679 
SEQ ED No. 714 
SEQ ED No. 737 
SEQ ED No. 977 
SEQ ED No. 994 
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SEQ ID No. 1028; SEQ ID No. 1029 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
which is involved in the virulence process, and in that it is chosen from the polypeptides having the 
5 following sequences: 

SEQ ID No. 20; SEQ ID No. 815; SEQ ID No. 816; SEQ ID No. 898; SEQ ID No. 1059; 
SEQ ID No. 1060 and one of their representative fragments. 

Preferably, the invention relates to a polypeptide according to the invention, 
characterized in that it is a Chlamydia trachomatis polypeptide or one of its representative fragments 
10 which is involved in the secretory system and/or which is secreted, and in that it is chosen from the 
polypeptides having the following sequences: 

SEQ ID No. 758; SEQ ID No. 888; SEQ ID No. 889; SEQ ID No. 890; SEQ ID No. 891; 
SEQ ID No. 896; SEQ ID No. 897; SEQ ID No. 898 and one of their representative fragments. 

The secreted polypeptides, including the Type III and other, non-Type m secreted 

15 polypeptides, of the present invention, as well as the corresponding nucleotide sequences, may be 
detected by techniques known to persons skilled in the art, such as for example the techniques using 
cloning combined with vectors allowing the expression of the said polypeptides fused to export 
markers such as the luc gene for luciferase or the PhoA gene for alkaline phosphatase. 

Preferably, the invention relates to a polypeptide according to the invention, 

20 characterized in that it is a polypeptide specific to Chlamydiae or one of its representative fragments, 
and in that it is chosen from the polypeptides having the following sequences: 

SEQ ID No. 22; SEQ ID No. 29; SEQ ID No. 31; SEQ ID No. 32; SEQ ID No. 34; SEQ ID No. 35; 
SEQ ID No. 39; SEQ ID No. 40; SEQ ID No. 43; SEQ ID No. 48; SEQ ID No. 49; SEQ ID No. 50; 
SEQ ID No. 52; SEQ ID No. 53; SEQ ID No. 54; SEQ ID No. 72; SEQ ID No. 77; SEQ ID No. 78; 
25 SEQ ID No. 87; SEQ ID No. 90; SEQ ID No. 95; SEQ ID No. 108; SEQ ID No. 110; 



SEQ ID No. Ill 
SEQ ID No. 138 
SEQ ID No. 164 
SEQ ID No. 186 

30 SEQ ED No. 247 
SEQ ID No. 273 
SEQ ID No. 306 
SEQ ID No. 326 
SEQ ID No. 380 

35 SEQ ID No. 433 



SEQ ID No. 122; 
SEQ ID No. 144; 
SEQ ID No. 166; 
SEQ ID No. 187; 
SEQ ID No. 258; 
SEQ ID No. 274; 
SEQ ID No. 309; 
SEQ ID No. 342; 
SEQ ID No. 388; 
SEQ ID No. 438; 



SEQ ID No. 123 
SEQ ID No. 146 
SEQ ID No. 175 
SEQ ID No. 188 
SEQ ID No. 266 
SEQ ID No. 295 
SEQ ID No. 3 18 
SEQ ID No. 357 
SEQ ID No. 390 
SEQ ID No. 443 



SEQ ID No. 124; 
SEQ ID No. 153; 
SEQ ID No. 182; 
SEQ ID No. 202; 
SEQ ID No. 267; 
SEQ ID No. 296; 
SEQ ID No. 319; 
SEQ ID No. 376; 
SEQ ID No. 400; 
SEQ ID No. 456; 



SEQ ID No. 127; 
SEQ ID No. 155; 
SEQ ID No. 184; 
SEQ ID No. 210; 
SEQ ID No. 270; 
SEQ ID No. 305; 
SEQ ID No. 322; 
SEQ ID No. 379; 
SEQ ID No. 431; 
SEQ ID No. 457; 
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SEQIDNo.458; SEQIDNo.464; SEQIDNo.468; SEQIDNo.470; SEQIDNo.473 
SEQroNo.486; SEQIDNo.489; SEQIDNo.497; SEQIDNo.501; SEQIDNo.503 
SEQIDNo.504; SEQIDNo.508; SEQ ID No. 512; SEQIDNo.521; SEQTDNo.522 
SEQIDNo.523; SEQIDNo.524; SEQIDNo.533; SEQIDNo.535; SEQ ID No. 536 
5 SEQIDNo.537; SEQIDNo.538; SEQIDNo.539; SEQIDNo.540; SEQIDNo.554 
SEQ ID No. 563; SEQIDNo.572; SEQIDNo.579; SEQIDNo.595; SEQIDNo.603 
SEQIDNo.604; SEQIDNo.606; SEQIDNo.607; SEQ ID No. 615; SEQIDNo.616 
SEQ ID No. 622; SEQIDNo.641; SEQ ID No. 642; SEQIDNo.659; SEQ ID No. 668 
SEQIDNo.670; SEQIDNo.693; SEQ ID No. 695; SEQIDNo.696; SEQ ID No. 699 
10 SEQIDNo.703; SEQIDNo.704; SEQIDNo.716; SEQE)No.726; SEQIDNo.728 
SEQIDNo.739; SEQIDNo.742; SEQIDNo.747; SEQIDNo.750; SEQIDNo.751 
SEQIDNo.755; SEQroNo.757; SEQIDNo.759; SEQIDNo.761; SEQ ID No. 762 
SEQIDNo.763; SEQIDNo.764; SEQ ID No. 773; SEQIDNo.780; SEQ ID No. 781 
SEQIDNo.789; SEQ ID No. 800; SEQ ID No. 803; SEQ ID No. 804; SEQ ID No. 818 
15 SEQ ED No. 820; SEQ ID No. 822; SEQ ID No. 823; SEQ ID No. 824; SEQ ID No. 827 
SEQ ID No. 828; SEQ ID No. 839; SEQ ID No. 849; SEQ ID No. 850; SEQ ID No. 851 
SEQ ID No. 852; SEQ ID No. 855; SEQ ID No. 856; SEQ ID No. 857; SEQ ID No. 858 
SEQ ID No. 859; SEQ ID No. 860; SEQ ID No. 861; SEQ ID No. 862; SEQ ID No. 863 
SEQ ID No. 865; SEQ ID No. 868; SEQ ID No. 869; SEQ ID No. 870; SEQ ID No. 871 
20 SEQ ID No. 872; SEQ ID No. 873; SEQ ID No. 874; SEQ ID No. 875; SEQ ID No. 877 
SEQ ID No. 878; SEQ ID No. 880; SEQ ID No. 882; SEQ ID No. 884; SEQ ID No. 886 
SEQ ID No. 893; SEQ ID No. 901; SEQ ID No. 906; SEQ ID No. 910; SEQ ID No. 912 
SEQ ID No. 915; SEQ ID No. 916; SEQ ID No. 917; SEQ ID No. 926; SEQ ID No. 929 
SEQ ID No. 933; SEQ ID No. 965; SEQ ID No. 967; SEQ ID No. 968; SEQ ID No. 984 
25 SEQ ID No. 986; SEQ ID No. 989; SEQ ID No. 990; SEQ ID No. 996; SEQ ID No. 997 
SEQ ID No. 1001; SEQ ID No. 1002; SEQ ID No. 1013; SEQ ID No. 1016; SEQ ID No. 1031 
SEQ ID No. 1033; SEQ ID No. 1035; SEQ ID No. 1049; SEQ ID No. 1051; SEQ ID No. 1052 
SEQ ID No. 1054; SEQ ID No. 1056; SEQ ID No. 1057; SEQ ID No. 1058; SEQ ID No. 1062 
SEQ ID No. 1070; SEQ ID No. 1071; SEQ ID No. 1073 and one of their representative fragments. 
30 In general, in the present invention, the functional group to which a polypeptide of the 

invention belongs, as well as its corresponding nucleotide sequence, may be determined either by 
comparative analogy with sequences already known, or by the use of standard techniques of 
biochemistry, of cytology combined with the techniques of genetic engineering such as 
immunoaffinity, localization by immunolabelling, differential extraction, measurement of enzymatic 
35 activity, study of the activity inducing or repressing expression or the study of expression in E. coli. 



WO 99/28475 PCT/IB98/01939 

49 

It is clearly understood, on the one hand, that, in the present invention, the nucleotide 
sequences (ORF) and the amino acid sequences (SEQ ID No. 2 to SEQ ID No. 1 197 ) which are listed 
by functional group, are not exhaustive within the group considered. Moreover, it is'also clearly 
understood that, in the present invention, a nucleotide sequence (ORF) or an amino acid sequence 
5 mentioned within a given functional group may also be part of another group taking into account, for 
example, the interrelationship between the groups listed. Accordingly, and as an example of this 
interrelationship, an exported and/or secreted polypeptide as well as its coding nucleotide sequence 
may also be involved in the Chlamydia trachomatis virulence process by modifying the defense 
mechanism of the infected host cell, or a transmembrane polypeptide or its coding nucleotide 
1 0 sequence is also part of the polypeptides or coding nucleotide sequences of the cellular envelope. 

The subject of the present invention is also the nucleotide and/or polypeptide 
sequences according to the invention, characterized in that the said sequences are recorded on a 
medium, called recording medium, whose type and nature facilitate the reading, the analysis and the 
exploitation of the said sequences. These media may of course also contain other information 
15 extracted from the present invention, such as in particular the analogies with already known 
sequences, such as those mentioned in Table 1 of the present description, and/or may contain, in 
addition, information relating to the nucleotide and/or polypeptide sequences of other microorganisms 
so as to facilitate the comparative analysis and the exploitation of the results obtained. 

Among these recording media, computer-readable media, such as magnetic, optical, 
20 electrical and hybrid media such as, for example, floppy disks, CD-ROMs or recording cassettes, are 
preferred in particular. 

The invention also relates to nucleotide sequences which can be used as primer or 
probe, characterized in that the said sequences are chosen from the nucleotide sequences according to 
the invention. 

25 The invention relates, in addition, to the use of a nucleotide sequence according to the 

invention, as primer or probe, for the detection and/or amplification of nucleic acid sequences. 

The nucleotide sequences according to the invention may thus be used to amplify 
nucleotide sequences, in particular by the PCR technique (polymerase chain reaction) (Erlich, 1989; 
Innis et al., 1990; Rolfs et al., 1991, and White et al., 1997). 

30 These oligodeoxyribonucleotide or oligoribonucleotide primers correspond to 

representative nucleotide fragments, and are advantageously at least 8 nucleotides, preferably at least 
12 nucleotides, 15 nucleotides and still more preferably at least 20 nucleotides long. 

Other techniques for amplifying the target nucleic acid may be advantageously used 
as alternatives to PCR. 

35 Th ^ nucleotide sequences of the invention, in particular the primers according to the 
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invention, may also be used in other methods for amplifying a target nucleic acid, such as: 

the TAS (Transcription-based Amplification System) technique described by Kwoh et al. in 

1989; 

the 3SR (Self-Sustained Sequence Replication) technique described by Guatelli et al. in 1990; 
5 - the NASBA (Nucleic Acid Sequence Based Amplification) technique described by Kievitis 
etal. in 1991; 

the SDA (Strand Displacement Amplification) technique (Walker et al., 1992); 
the TMA (Transcription Mediated Amplification) technique. 

The polynucleotides of the invention may also be used in techniques for amplifying 
10 or for modifying the nucleic acid serving as probe, such as: 

the LCR (Ligase Chain Reaction) technique described by Landegren etal. in 1988 and 
perfected by Barany et al. in 1991, which uses a thermostable ligase; 

the RCR (Repair Chain Reaction) technique described by Segev in 1992; 
the CPR (Cycling Probe Reaction) technique described by Duck et al. in 1990; 
15 - the Q-beta-replicase amplification technique described by Miele et al. in 1983 and perfected 
in particular by Chu et al. in 1986, Lizardi et al. in 1988, and then by Burg et al. as well as by Stone 
et al. in 1996. 

The invention also relates to the nucleotide sequences of fragments which can be 
obtained by amplification with the aid of at least one primer according to the invention. The present 

20 invention encompasses both hybridization probes and primers. In general, the complementary probes 
should be of the length sufficient to form a stable hybrid complex with the target sequences. Primers, 
while complementary to the target sequences need not form stable hybridization complexes with the 
target sequences alone. Rather, primers form stable complexes with the target sequences in the 
presence of polymerase to permit extension of the primer. 

25 the case where the target polynucleotide to be detected is possibly an RNA, for 

example an mRNA, it will be possible to use, prior to the use of an amplification reaction with the aid 
of at least one primer according to the invention or to the use of a method of detection with the aid of 
at least one probe of the invention, a reverse transcriptase-type enzyme so as to obtain a cDNA from 
the RNA contained in the biological sample. The cDNA obtained will then serve as target for the 

30 primer(s) or the probe(s) used in the amplification or detection method according to the invention. 

The detection probe will be chosen so that it hybridizes with the target sequence or 
the amplicon generated from the target sequence. Such a detection probe will advantageously have as 
sequence a sequence of at least 12 nucleotides, 15 nucleotides, in particular of at least 20 nucleotides, 
and preferably at least 100 nucleotides. 

35 The invention also comprises the nucleotide sequences which can be used as probe or 
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primer according to the invention, characterized in that they are labelled with a radioactive compound 
or with a nonradioactive compound. 

The nonlabelled nucleotide sequences may be used directly as probes or primers; 
however, the sequences are generally labelled with a radioactive element ( 32 P, 35 S, 3 H, 125 I) or with a 
5 nonradioactive molecule (biotin, acetylaminofluorene, digoxigenin, 5-bromo-deoxyuridine, 
fluorescein) so as to obtain probes which can be used in numerous applications. 

Examples of nonradioactive labelling of nucleotide sequences are described, for 
example, in French patent No. 78,10975 or by Urdea et al. or by Sanchez-Pescador et al. in 1988. 

In the latter case, one of the labelling methods described in patents FR-2 422 956 and 
10 FR-2 518 755 may also be used. 

The invention also relates to the nucleotide sequences of fragments which can be 
obtained by hybridization with the aid of at least one probe according to the invention. 

The hybridization technique may be performed in various ways (Matthews et al M 
1988). The most common method consists in immobilizing the nucleic acid extracted from C. 
15 trachomatis cells on a support (such as nitrocellulose, nylon, polystyrene) and in incubating, under 
well-defined conditions, the target nucleic acid immobilized with the probe. After hybridization, the 
excess probe is removed and the hybrid molecules formed are detected by the appropriate method 
(measurement of the radioactivity, of the fluorescence or of the enzymatic activity linked to the 
probe). 

20 The invention also comprises the nucleotide sequences according to the invention, 

characterized in that they are covalently or noncovalently immobilized on a support. 

According to another advantageous embodiment of the nucleic sequences according 
to the invention, the latter may be used immobilized on a support and may thus serve to capture, 
through specific hybridization, the target nucleic acid obtained from the biological sample to be 

25 tested. If necessary, the solid support is separated from the sample and the hybridization complex 
formed between the so-called capture probe and the target nucleic acid is then detected by means of a 
second probe, called detection probe, labelled with an easily detectable element. 

The nucleotide sequences according to the invention may also be used in new 
analytical systems, DNA chips, which allow sequencing, the study of mutations and of the expression 

30 of genes, and which are currently of interest given their very small size and their high capacity in 
terms of number of analyses. 

The principle of the operation of these chips is based on molecular probes, most often 
oligonucleotides, which are attached onto a miniaturized surface, generally of the order of a few 
square centimetres. During an analysis, a sample containing fragments of a target nucleic acid to be 

35 analysed, for example DNA or RNA labelled, for example, after amplification, is deposited onto the 
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DNA chip in which the support has been coated beforehand with probes. Bringing the labelled target 
sequences into contact with the probes leads to the formation, through hybridization, of a duplex 
according to the rule of pairing defined by J.D. Watson and F. Crick. After a washing step, analysis of 
the surface of the chip allows the effective hybridizations to be located by means of the signals 
5 emitted by the labels tagging the target. A hybridization fingerprint results from this analysis which, 
by appropriate computer processing, will make it possible to determine information such as the 
presence of specific fragments in the sample, the determination of sequences and the presence of 
mutations. 

The chip consists of a multitude of molecular probes, precisely organized or arrayed 
10 on a solid support whose surface is miniaturized. It is at the centre of a system where other elements 
(imaging system, microcomputer) allow the acquisition and interpretation of a hybridization 
fingerprint. 

The hybridization supports are provided in the form of flat or porous surfaces 
(pierced with wells) composed of various materials. The choice of a support is determined by its 

15 physicochemical properties, or more precisely, by the relationship between the latter and the 
conditions under which the support will be placed during the synthesis or the attachment of the probes 
or during the use of the chip. It is therefore necessary, before considering the use of a particular 
support (R.S. Matson et al., 1994), to consider characteristics such as its stability to pH, its physical 
strength, its reactivity and its chemical stability as well as its capacity to nonspecifically bind nucleic 

20 acids. Materials such as glass, silicon and polymers are commonly used. Their surface is, in a first 
step, called «functionalization», made reactive towards the groups which it is desired to attach 
thereon. After the fiinctionalization, so-called spacer molecules are grafted onto the activated surface. 
Used as intermediates between the surface and the probe, these molecules of variable size render 
unimportant the surface properties of the supports, which often prove to be problematic for the 

25 synthesis or the attachment of the probes and for the hybridization. 

Among the hybridization supports, there may be mentioned glass which is used, for 
example, in the method of in situ synthesis of oligonucleotides by photochemical addressing 
developed by the company Affymetrix (E.L. Sheldon, 1993), the glass surface being activated by 
silane. Genosensor Consortium (P. M6rel, 1994) also uses glass slides canying wells 3 mm apart, this 

30 support being activated with epoxysilane. 

Polymers or silicon may also be mentioned among these hybridization supports. For 
example, the Andrein Mirzabekov team has developed a chip consisting of polyacrylamide squares 
polymerized on a silanized glass surface (G. Yershov etal., 1996). Several teams use silicon, in 
particular the IFOS laboratory of Ecole Centrale of Lyon which uses a silicon semiconductor substrate 

35 which is p-doped by introducing it into its crystalline structure atoms whose valency is different from 



WO 99/28475 PCT/IB98/01939 

53 

that of silicon. Various types of metals, in particular gold and platinum, may also be used as support 

(Genosensor Consortium (K. Beattie et ah, 1993)). 

The probes according to the invention may be synthesized directly in'situ on the 

supports of the DNA chips. This in situ synthesis may be carried out by photochemical addressing 
5 (developed by the company Affymax (Amsterdam, Holland) and exploited industrially by its 

subsidiary Affymetrix (United States)) or based on the VLSIPS (very large scale immobilized 

polymer synthesis) technology (S.RA. Fodor etal., 1991) which is based on a method of 

photochemically directed combinatory synthesis and the principle of which combines solid-phase 

chemistry, the use of photolabile protecting groups and photolithography. 
10 The probes according to the invention may be attached to the DNA chips in various 

ways such as electrochemical addressing, automated addressing or the use of probe printers 

(T. Livache et al., 1994; G. Yershov et al., 1996; J. Derisi et al. f 1996, and S. Borman, 1996). 

The revealing of the hybridization between the probes of the invention, deposited or 

synthesized in situ on the supports of the DNA chips, and the sample to be analysed, may be 
15 determined, for example, by measurement of fluorescent signals, by radioactive counting or by 

electronic detection. 

The use of fluorescent molecules such as fluorescein constitutes the most common 
method of labelling the samples. It allows direct or indirect revealing of the hybridization and allows 
the use of various fluorochromes. 

20 Affymetrix currently provides an apparatus or a scanner designed to read its Gene 

Chip? chips. It makes it possible to detect the hybridizations by scanning the surface of the chip in 
confocal microscopy (R J. Lipshutz et al., 1995). Other methods of detecting fluorescent signals have 
been tested: coupling of an epifluorescence microscope and a CCD camera (G. Yershov et al., 1996), 
the use of an optica] fibre collecting system (E.L. Sheldon, 1993). A conventional method consists in 

25 carrying out an end labelling, with phosphorus 32, of the target sequences, by means of an appropriate 
apparatus, the Phosphorimager (marketed by Molecular Dynamics). The electronic detection is based 
on the principle that the hybridization of two nucleic acid molecules is accompanied by physical 
phenomena which can be quantified under certain conditions (system developed by Ecole Centrale of 
Lyon and called GEN-FET (GEN field effect transistor)). Genosensor Consortium and the company 

30 Beckman Instruments who are developing an electronic chip or Permittivity Chips? may also be 
mentioned (K. Beattie et al., 1993). 

The nucleotide sequences according to the invention may thus be used in DNA chips 
to carry out the analysis of mutations. This analysis is based on the production of chips capable of 
analysing each base of a nucleotide sequence according to the invention. It is possible, in particular to 

35 this end, to use the microsequencing techniques on a DNA chip. The mutations are detected by 
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extending immobilized primers which hybridize to the template of sequences analysed, just at the 
position adjacent to that of the mutated nucleotide to be detected. A single-stranded template, RNA or 
DNA, of the sequences to be analysed will be advantageously prepared according to conventional 
methods, from products amplified according to PCR-type techniques. The templates of single- 
5 stranded DNA, or of RNA thus obtained are then deposited on the DNA chip, under conditions 
allowing their specific hybridization to the immobilized primers. A thermostable polymerase, for 
example Tth or T7 DNA polymerase, specifically extends the 3 ' end of the immobilized primer with 
a labelled nucleotide analogue complementary to the nucleotide at the position of the variable site. 
For example a thermal cycling is performed in the presence of fluorescent dideoxyribonucleotides. 
10 The experimental conditions will be adapted in particular to the chips used, to the immobilized 
primers, to the polymerases used and to the labelling system chosen. One advantage of 
microsequencing, compared with techniques based on the hybridization of probes, is that it makes it 
possible to identify all the variable nucleotides with optimal discrimination under homogeneous 
reaction conditions; used on DNA chips, it allows optimal resolution and specificity for the routine 

15 and industrial detection of mutations in multiplex. 

The nucleotide sequences according to the invention may also be used in DNA chips 
to carry out the analysis of the expression of the Chlamydia trachomatis genes. This analysis of the 
expression of Chlamydia trachomatis genes is based on the use of chips where probes of the 
invention, chosen for their specificity to characterize a given gene, are present (DJ. Lockhart et al., 

20 1996; D.D. Shoemaker et al., 1996). For the methods of analysis of gene expression using the DNA 
chips, reference may, for example, be made to the methods described by D.J. Lockhart etal. (1996) 
and Sosnowsky etal. (1997) for the synthesis of probes in situ or for the addressing and the 
attachment of previously synthesized probes. The target sequences to be analysed are labelled and in 
general fragmented into sequences of about 50 to 100 nucleotides before being hybridized onto the 

25 chip. After washing as described, for example, by DJ. Lockhart etal. (1996) and application of 
different electric fields (Sosnowsky et al., 1997), the labelled compounds are detected and quantified, 
the hybridizations being carried out at least in duplicate. Comparative analyses of the signal 
intensities obtained with respect to the same probe for different samples and/or for different probes 
with the same sample, determine the differential expression of RNA or of DNA derived from the 

30 sample. 

The nucleotide sequences according to the invention may, in addition, be used in 
DNA chips where other nucleotide probes specific for other microorganisms are also present, and may 
allow the carrying out of a serial test allowing rapid identification of the presence of a microorganism 
in a sample. 

35 Accordingly, the subject of the invention is also the nucleotide sequences according 
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to the invention, characterized in that they are immobilized on a support of a DNA chip. 

The DNA chips, characterized in that they contain at least one nucleotide sequence 
according to the invention, immobilized on the support of the said chip, also form* part of the 
invention. 

5 The said chips will preferably contain several probes or nucleotide sequences of the 

invention of different length and/or corresponding to different genes so as to identify, with greater 
certainty, the specificity of the target sequences or the desired mutation in the sample to be analysed. 

Accordingly, the analyses carried out by means of primers and/or probes according to 
the invention, immobilized on supports such as DNA chips, will make it possible, for example, to 
10 identify, in samples, mutations linked to variations such as intraspecies variations. These variations 
may be correlated or associated with pathologies specific to the variant identified and will make it 
possible to select the appropriate treatment. 

The invention thus comprises a DNA chip according to the invention, characterized in 
that it contains, in addition, at least one nucleotide sequence of a microorganism different from 
15 Chlamydia trachomatis, immobilized on the support of the said chip; preferably, the different 
microorganism will be chosen from an associated microorganism, a bacterium of the Chlamydia 
family, and a variant of the species Chlamydia trachomatis. 

Another subject of the present invention is a vector for the cloning and/or the 
expression of a sequence, characterized in that it contains a nucleotide sequence according to the 
20 invention. 

Among the said vectors according to the invention, the vectors containing a 
nucleotide sequence encoding a polypeptide of the cellular, preferably outer, envelope of Chlamydia 
trachomatis or one of its representative fragments, are preferred. 

In a specific embodiment, the vectors contain a nucleotide sequence encoding a 

25 Chlamydia trachomatis secreted polypeptide or one of its representative fragments or encoding a 
transport polypeptide, a surface exposed polypeptide, a lipoprotein or one of its representative 
fragments, a polypeptide involved in lipopolysaccharide (LPS) biosynthesis, a Type III or non-Type 
HI secreted polypeptide, a polypeptide containing RGD attachment sites, a cell wall anchored surface 
polypeptide, a polypeptide not found in Chlamydia pneumoniae, a ribosomal polypeptide or a 

30 polypeptide involved in secretion, transcription, translation, maturation of proteins, a polypeptide 
involved in the synthesis of the wall, a polypeptide involved in the virulence, a polypeptide involved 
in the intermediate metabolism, in particular in the metabolism of sugars and/or of cofactors, a 
polypeptide involved in the metabolism of nucleotides, of amino acids, of nucleic acids or of fatty 
acids of Chlamydia trachomatis or one of their representative fragments, or a polypeptide specific to 

35 Chlamydiae, are also preferred. 
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According to the invention, the vectors comprise the elements necessary to allow the 
expression and/or the secretion of the said nucleotide sequences in a given host cell, and also form 
part of the invention. 

The vector should, in this case, comprise a promoter, signals for initiation and for 
5 termination of translation, as well as appropriate regions for regulation of transcription. It should be 
capable of being stably maintained in the host cell and may optionally possess particular signals 
specifying the secretion of the translated protein. These different elements are chosen according to the 
host cell used. To this effect, the nucleotide sequences according to the invention may be inserted into 
autonomously-replicating vectors within the chosen host, or integrative vectors in the chosen host. 
10 Any of the standard methods known to those skilled in the art for the insertion of 

DNA fragments into a vector may be used to construct expression vectors containing a chimeric gene 
consisting of appropriate transcriptional/translational control signals and the protein coding 
sequences. These methods may include in vitro recombinant DNA and synthetic techniques and in 
vivo recombinants (genetic recombination). 
15 Expression of a polypeptide, peptide or derivative, or analogs thereof encoded by a 

polynucleotide sequence in SEQ ID No. 1 or ORFs contained within SEQ ID No. 1 may be regulated 
by a second nucleic acid sequence so that the protein or peptide is expressed in a host transformed 
with the recombinant DNA molecule. For example, expression of a protein or peptide may be 
controlled by any promoter/enhancer element known in the art. Promoters which may be used to 
20 control expression include, but are not limited to, the CMV promoter, the SV40 early promoter region 
(Bernoist and Chambon, 1981, Nature 290:304-310), the promoter contained in the 3* long terminal 
repeat of Rous sarcoma virus (Yamamoto, et aL, 1980, Cell 22:787-797), the herpes thymidine kinase 
promoter (Wagner et al. t 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences 
of the metallothionein gene (Brinster et aL, 1982, Nature 226:39-42); prokaryotic expression vectors 
25 such as the p-lactamase promoter (Villa-Kamaroff, et aL, 1978, Proc. Natl. Acad. Sci. U.S.A. 
75:3727-3731), or the tac promoter (DeBoer, et aL, 1983, Proc. Natl. Acad. Sci. U.S.A. 80:21-25); see 
also "Useful proteins from recombinant bacteria" in Scientific American, 1980, 242:74-94; plant 
expression vectors comprising the nopaline synthetase promoter region (Herrera-Estrella et aL, 1983, 
Nature 303:209-213) or the cauliflower mosaic virus 35S RNA promoter (Gardner, et aL, 1981, Nucl. 
30 Acids Res. 9:2871), and the promoter of the photosynthetic enzyme ribulose biphosphate carboxylase 
(Herrera-Estrella et aL, 1984, Nature 310: 1 15-120); promoter elements from yeast or other fungi such 
as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) 
promoter, alkaline phosphatase promoter, and the following animal transcriptional control regions, 
which exhibit tissue specificity and have been utilized in transgenic animals: elastase I gene control 
35 region which is active in pancreatic acinar cells (Swift et aL, 1984, Cell 38:639-646; Ornitz et aL, 
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1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; MacDonald, 1987, Hepatology 7:425- 
515); insulin gene control region which is active in pancreatic beta cells (Hanahan, 1985, Nature 
315:115-122), immunoglobulin gene control region which is active in lymphoid cells (Grosschedl et 
a/., 1984, Cell 38:647-658; Adames et al % 1985, Nature 318:533-538; Alexander et aL, 1987, Mol. 
5 Cell. Biol. 7:1436-1444), mouse mammary tumor virus control region which is active in testicular, 
breast, lymphoid and mast cells (Leder et al> 1986, Cell 45:485-495), albumin gene control region 
which is active in liver (Pinkert et ai y 1987, Genes and Devel. 1:268-276), alpha-fetoprotein gene 
control region which is active in liver (Krumlauf et ai, 1985, Mol, Cell. Biol. 5:1639-1648; Hammer 
et al % 1987, Science 235:53-58; alpha 1 -antitrypsin gene control region which is active in the liver 

10 (Kelsey et ai, 1987, Genes and Devel. 1:161-171), beta-globin gene control region which is active in 
myeloid cells (Mogram et ai f 1985, Nature 315:338-340; Kollias et al y 1986, Cell 46:89-94; myelin 
basic protein gene control region which is active in oligodendrocyte cells in the brain (Readhead et 
a/., 1987, Cell 48:703-712); myosin light chain-2 gene control region which is active in skeletal 
muscle (Sani, 1985, Nature 314:283-286), and gonadotropic releasing hormone gene control region 

15 which is active in the hypothalamus (Mason et al % 1986, Science 234:1372-1378). 

The vectors according to the invention are, for example, vectors of plasmid or viral 
origin. In a specific embodiment, a vector is used that comprises a promoter operably linked to a 
protein or peptide-encoding nucleic acid sequence in SEQ ID No. 1, or ORFs contained within SEQ 
ID No. 1, one or more origins of replication, and, optionally, one or more selectable markers (e.g., an 

20 antibiotic resistance gene). Expression vectors comprise regulatory sequences that control gene 
expression, including gene expression in a desired host cell. Preferred vectors for the expression of 
the polypeptides of the invention include the pET-type plasmid vectors (Promega) or pBAD plasmid 
vectors (Invitrogen). Furthermore, the vectors according to the invention are useful for transforming 
host cells so as to clone or express the nucleotide sequences of the invention. 

25 Expression can also be achieved using targeted homologous recombination to activate 

Chlamydia trachomatis genes present in the cloned genomic DNA. A heterologous regulatory 
element may be inserted into a stable cell line or cloned microorganism, such that it is operatively 
linked with an endogenous Chlamydia trachomatis gene present in the cloned genome, using 
techniques, such as targeted homologous recombination, which are well known to those of skill in the 

30 art (See, e^, Chappel, U.S. Patent No. 4,215,051 and Skoultchi, WO 91/06667 each of which is 
incorporated herein in its entirety). 

Expression vector/host cell systems containing inserts of polynucleotide sequences in 
SEQ ID No. 1 or ORFs within SEQ ID No. 1, which encode polypeptides, peptides or derivatives, or 
analogs thereof, can be identified by three general approaches: (a) nucleic acid hybridization, (b) 

35 presence or absence of "marker" gene functions, and (c) expression of inserted sequences. In the first 
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approach, the presence of a polynucleotide sequence inserted in an expression vector can be detected 
by nucleic acid hybridization using probes comprising sequences that are homologous to an inserted 
polynucleotide sequence. In the second approach, the recombinant vector/host system can be 
identified and selected based upon the presence or absence of certain "marker" gene functions (e.g., 
5 thymidine kinase activity, resistance to antibiotics, transformation phenotype, occlusion body 
formation in baculovirus, etc.) caused by the insertion of a polynucleotide sequence in the vector. For 
example, if the polynucleotide sequence in SEQ ID No. 1 or ORFs within SEQ ID No. 1 is inserted 
within the marker gene sequence of the vector, recombinants containing the insert can be identified by 
the absence of the marker gene function. In the third approach, recombinant expression vectors can 
10 be identified by assaying the product of the polynucleotide sequence expressed by the recombinant. 
Such assays can be based, for example, on the physical or functional properties of the expressed 
polypeptide in in vitro assay systems, e.g., binding with antibody, promotion of cell proliferation. 

Once a particular recombinant DNA molecule is identified and isolated, several 
methods known in the art may be used to propagate it. The clones identified may be introduced into 
15 an appropriate host cell by standard methods, such as for example lipofection, electroporation, and 
heat shock. Once a suitable host system and growth conditions are established, recombinant 
expression vectors can be propagated and prepared in quantity. 

The invention also encompasses the host cells transformed by a vector according to 
the invention. These cells may be obtained by introducing into host cells a nucleotide sequence 
20 inserted into a vector as defined above, and then culturing the said cells under conditions allowing the 
replication and/or the expression of the transfected nucleotide sequence. 

The host cell may be chosen from eukaryotic or prokaryotic systems, such as for 
example bacterial cells (Olins and Lee, 1993), but also yeast cells (Buckholz, 1993), as well as animal 
cells, in particular cultures of mammalian cells (Edwards and Aruffo, 1993), and in particular Chinese 
25 hamster ovary (CHO) cells, but also insect cells in which methods using baculoviruses for example 
may be used (Luckow, 1993). 

Furthermore, a host cell strain may be chosen which modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific fashion desired. 
Expression from certain promoters can be elevated in the presence of certain inducers; thus, 
30 expression of the genetically engineered polypeptide may be controlled. Furthermore, different host 
cells have characteristic and specific mechanisms for the translational and post-translational 
processing and modification (e.g., glycosylation, phosphorylation) of proteins. Appropriate cell lines 
or host systems can be chosen to ensure the desired modification and processing of the foreign protein 
expressed. For example, expression in a bacterial system can be used to produce an unglycosylated 
35 core protein product. Expression in yeast will produce a glycosylated product. Expression in 
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mammalian cells can be used to ensure "native" glycosylation of a heterologous protein. Furthermore, 
different vector/host expression systems may effect processing reactions to different extents. 

A preferred host cell for the expression of the proteins of the invention consists of 
prokaryotic cells, such as Gram negative bacteria. 
5 A further preferred host cell according to the invention is a bacterium belonging to 

the Chlamydia family, more preferably belonging to the species Chlamydia trachomatis or chosen 
from a microorganism associated with the species Chlamydia trachomatis. 

In other specific embodiments, the polypeptides, peptides or derivatives, or analogs 
thereof may be expressed as a fusion, or chimeric protein product (comprising the protein, fragment, 
10 analog, or derivative joined via a peptide bond to a heterologous protein sequence (of a different 
protein)). Such a chimeric product can be made by ligating the appropriate nucleic acid sequences 
encoding the desired amino acid sequences to each other by methods known in the art, in the proper 
coding frame, and expressing the chimeric product by methods commonly known in the art. 
Alternatively, such a chimeric product may be made by protein synthetic techniques, e.g., by use of a 
15 peptide synthesizer. 

Genomic sequences can be cloned and expressed as translation^ gene products 
peptides, polypeptides, and proteins) or transcriptional gene products antisense and ribozymes). 

The invention further relates to the intracellular production of an antisense nucleic 
acid sequence of SEQ ID No. 1 by transcription from an exogenous sequence. For example, a vector 

20 can be introduced in vivo such that it is taken up by a cell, within which cell the vector or a portion 
thereof is transcribed, producing an antisense nucleic acid (RNA) of the invention. Such a vector 
would contain a sequence encoding an antisense nucleic acid. Such a vector can remain episomal or 
become chromosomally integrated, as long as it can be transcribed to produce the desired antisense 
RNA. Such vectors can be constructed by recombinant DNA technology methods standard in the art. 

25 Vectors can be plasmid, viral, or others known in the art, used for replication and expression in 
mammalian cells. Expression of the sequence encoding the antisense RNA can be by any promoter 
known in the art to act in mammalian, preferably human, cells. Such promoters can be inducible or 
constitutive. Such promoters include but are not limited to: the CMV promoter, the SV40 early 
promoter region (Bemoist and Chambon. 1981, Nature 290:304-310), the promoter contained in the 

30 3n long terminal repeat of Rous sarcoma virus (Yamamoto et ai, 1980, Cell 22:787-797), the herpes 
thymidine kinase promoter (Wagner et ai, 1981, Proc. Natl. Acad. Sci. U.S.A. 78j 1441 -1445), the 
regulatory sequences of the metallothionein gene (Brinster et ai, 1982, Nature 226:39-42), etc. 

In a specific embodiment, the antisense oligonucleotide comprises catalytic RNA, or 
a ribozyme (see, e.g., PCT International Publication WO 90/1 1364, published October 4, 1990; Sarver 

35 et ai, 1990, Science 247:1222-1225). In another embodiment, the oligonucleotide is a 2 N -0- 
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methylribonucleotide (Inoue et ai, 1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 
analog (Inoue et aL. 1987, FEBS Lett. 215:327-330). 

In another embodiment, the antisense nucleic acids of the invention "comprise a 
sequence complementary to at least a portion of an RNA transcript of a polynucleotide sequence in 
5 SEQ ID No, 1. However, absolute complementarity, although preferred, is not required. A sequence 
"complementary to at least a portion of an RNA," as referred to herein, means a sequence having 
sufficient complementarity to be able to hybridize with the RNA, forming a stable duplex; in the case 
of double-stranded antisense nucleic acid sequence, a single strand of the duplex DNA may thus be 
tested, or triplex formation may be assayed. The ability to hybridize will depend on both the degree 
10 of complementarity and the length of the antisense nucleic acid. Generally, the longer the hybridizing 
nucleic acid, the more base mismatches with an RNA transcribed from SEQ ID No. 1 may contain and 
still form a stable duplex (or triplex, as the case may be). One skilled in the art can ascertain a 
tolerable degree of mismatch by use of standard procedures to determine the melting point of the 
hybridized complex. 

15 The invention also relates to the animals, except humans, comprising one of the 

above-described transformed cells according to the invention. 

The production of transgenic animals according to the invention overexpressing one 
or more of the Chlamydia trachomatis genes will be preferably carried out on rats, mice or rabbits 
according to methods well known to persons skilled in the art such as viral or nonviral transfections. 

20 The transgenic animals overexpressing one or more of the said genes may be obtained by transfection 
of multiple copies of the said genes under the control of a powerful promoter of a ubiquitous nature, 
or which is selective for one type of tissue. The transgenic animals may also be obtained by 
homologous recombination on embryonic stem cells, transfer of these stem cells to embryos, selection 
of the chimeras affected at the level of the reproductive lines, and growth of the said chimeras. 

25 The transformed cells as well as the transgenic animals according to the invention can 

be used in methods of preparing the recombinant polypeptide. 

It is now possible to produce recombinant polypeptides in a relatively large quantity 
by genetic engineering using the cells transformed with expression vectors according to the invention 
or using transgenic animals according to the invention. 

30 The methods of preparing a polypeptide of the invention in recombinant form, 

characterized in that they use a vector and/or a cell transformed with a vector according to the 
invention and/or a transgenic animal comprising one of the said transformed cells according to the 
invention, are themselves included in the present invention. 

Among the said methods of preparing a polypeptide of the invention in recombinant 

35 form, the methods of preparation using a vector, and/or a cell transformed with the said vector and/or 
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a transgenic animal comprising one of the said transformed cells, containing a nucleotide sequence 
encoding a polypeptide of the cellular envelope of Chlamydia trachomatis or one of its representative 
fragments, more preferably encoding a polypeptide of the outer cellular envelope of Chlamydia 
trachomatis or one of its fragment, are preferred. 
5 Among the said methods of preparing a polypeptide of the invention in recombinant 

form, the methods of preparation using a vector, and/or a cell transformed with the said vector and/or 
a transgenic animal comprising one of the said transformed cells, containing a nucleotide sequence 
encoding a Chlamydia trachomatis secreted polypeptide or one of its representative fragments, or 
encoding a transport polypeptide, a surface exposed polypeptide, a lipoprotein or one of its 
10 representative fragments, a polypeptide involved in lipopolysaccharide biosynthesis, a Type IE or 
other secreted polypeptide, a polypeptide containing RGD attachment sites, a cell wall anchored 
surface polypeptide, a polypeptide not found in Chlamydia pneumoniae, a ribosomal polypeptide or a 
polypeptide involved in secretion, transcription, translation, maturation of proteins, a polypeptide 
involved in the synthesis of the wall, a polypeptide involved in the virulence, a polypeptide involved 
15 in the intermediate metabolism, in particular in the metabolism of sugars and/or of cofactors, a 
polypeptide involved in the metabolism of nucleotides, of amino acids, of nucleic acids or of fatty 
acids of Chlamydia trachomatis or one of their representative fragments, or a polypeptide specific to 
Chlamydiae, are also preferred. 

The recombinant polypeptides obtained as indicated above may be provided either in 
20 glycosylated or nonglycosylated form and may or may not have the natural tertiary structure. 

A preferred variant consists in producing a recombinant polypeptide fused to a 
«carrier» protein (chimeric protein). The advantage of this system is that it allows a stabilization and 
a reduction in proteolysis of the recombinant product, an increase in solubility during renaturation 
in vitro and/or a simplification of purification when the fusion partner has affinity for a specific 
25 ligand. 

More particularly, the invention relates to a method of preparing a polypeptide of the 
invention comprising the following steps: 

a) culture of the transformed cells under conditions allowing the expression of a recombinant 
polypeptide having a nucleic acid sequence according to the invention; 
30 b) where appropriate, recovery of the said recombinant polypeptide. 

When the method of preparing a polypeptide of the invention uses a transgenic 
animal according to the invention, the recombinant polypeptide is then extracted from the said animal. 

The subject of the invention is also a polypeptide capable of being obtained by a 
method of the invention as described above. 
35 The invention also comprises a method of preparing a synthetic polypeptide, 
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characterized in that it uses an amino acid sequence of polypeptides according to the invention. 

The invention also relates to a synthetic polypeptide obtained by a method according 

to the invention. 

Polypeptides according to the invention may also be prepared by conventional 
5 techniques in the field of peptide synthesis under conditions suitable to produce the polypeptides 
encoded by the polynucleotide of the invention. This synthesis may be carried out in and recovered 
from a homogeneous solution or on a solid phase. 

For example, the synthesis technique in a homogeneous solution described by 
Houbenweyl in 1974 may be used. 
10 This method of synthesis consists in successively condensing, in pairs, the successive 

amino acids in the required order, or in condensing amino acids and fragments previously formed and 
already containing several amino acids in the appropriate order, or alternatively several fragments 
thus previously prepared, it being understood that care will have been taken to protect beforehand all 
the reactive functional groups carried by these amino acids or fragments, with the exception of the 
15 amine functional groups of one and the carboxyl functional groups of the other or vice versa, which 
should normally take part in the formation of the peptide bonds, in particular after activation of the 
carboxyl functional group, according to methods well known in peptide synthesis. 

According to another preferred technique of the invention, the one described by 
Merrifield is used. 

20 To manufacture a peptide chain according to the Merrifield method, a highly porous 

polymer resin is used, onto which the first C-terminal amino acid of the chain is attached. This amino 
acid is attached onto a resin via its carboxyl group and its amine functional group is protected. The 
amino acids which will constitute the peptide chain are thus attached, one after another, onto the 
amine group, each time deprotected beforehand, of the portion of the peptide chain already formed, 

25 and which is attached to the resin. When the entire peptide chain desired is formed, the protecting 
groups are removed from the various amino acids constituting the peptide chain and the peptide is 
detached from the resin with the aid of an acid. 

The invention relates, in addition, to hybrid (fusion) polypeptides having at least one 
polypeptide or one of its representative fragments according to the invention, and a sequence of a 

30 polypeptide capable of eliciting an immune response in humans or animals. 

Advantageously, the antigenic determinant is such that it is capable of eliciting a 
humoral and/or cellular response. 

An antigenic determinant may be identified by screening expression libraries of the 
Chlamydia trachomatis genome with antibodies contained in the serum of patients infected with a 

35 bacterium belonging to the species Chlamydia trachomatis. An antigenic determinant may comprise a 
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polypeptide or one of its fragments according to the invention, in glycosylated form, used in order to 
obtain immunogenic compositions capable of inducing the synthesis of antibodies directed against 
multiple epitopes. The said polypeptides or their glycosylated fragments also form" part of the 
invention, 

5 These hybrid molecules may consist, in part, of a carrier molecule for polypeptides or 

for their representative fragments according to the invention, combined with a portion which may be 
immunogenic, in particular an epitope of the diphtheria toxin, the tetanus toxin, a hepatitis B virus 
surface antigen (patent FR 79 21811), the poliomyelitis vims VP1 antigen or any other viral or 
bacterial toxin or antigen. 

10 The methods of synthesizing the hybrid molecules include the methods used in 

genetic engineering to construct hybrid nucleotide sequences encoding the desired polypeptide 
sequences. Reference may be advantageously made, for example, to the technique for producing 
genes encoding fusion proteins described by Minton in 1984. 

The said hybrid nucleotide sequences encoding a hybrid polypeptide as well as the 

15 hybrid polypeptides according to the invention, characterized in that they are recombinant 
polypeptides obtained by the expression of the said hybrid nucleotide sequences, also form part of the 
invention. 

The invention also comprises the vectors characterized in that they contain one of the 
said hybrid nucleotide sequences. The host cells transformed by the said vectors, the transgenic 

20 animals comprising one of the said transformed cells as well as the methods of preparing recombinant 
polypeptides using the said vectors, the said transformed cells and/or the said transgenic animals of 
course also form part of the invention. 

The polypeptides according to the invention, the antibodies according to the invention 
described below and the nucleotide sequences according to the invention may advantageously be used 

25 in in vitro and/or in vivo methods for the detection and/or the identification of bacteria belonging to 
the species Chlamydia trachomatis, in a biological sample (biological tissue or fluid) which is likely 
to contain them. These methods, depending on the specificity of the polypeptides, of the antibodies 
and of the nucleotide sequences according to the invention which will be used, may in particular 
detect and/or identify the bacterial variants belonging to the species Chlamydia trachomatis as well as 

30 the associated microorganisms capable of being detected by the polypeptides, the antibodies and the 
nucleotide sequences according to the invention which will be chosen. It may, for example, be 
advantageous to choose a polypeptide, an antibody or a nucleotide sequence according to the 
invention, which is capable of detecting any bacterium of the Chlamydia family by choosing a 
polypeptide, an antibody and/or a nucleotide sequence according to the invention which is specific to 

35 the family or, on the contrary, it will be most particularly advantageous to target a variant of the 
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species Chlamydia trachomatis, which is responsible, for example, for the induction or the worsening 
of pathologies specific to the targeted variant, by choosing a polypeptide, an antibody and/or a 
nucleotide sequence according to the invention which is specific to the said variant. 

The polypeptides according to the invention may advantageously be used in a method 
5 for the detection and/or the identification of bacteria belonging to the species Chlamydia trachomatis 
or to an associated microorganism, in a biological sample (biological tissue or fluid) which is likely to 
contain them, characterized in that it comprises the following steps: 

a) bringing this biological sample into contact with a polypeptide or one of its representative 
fragments according to the invention (under conditions allowing an immunological reaction between 

10 the said polypeptide and the antibodies which may be present in the biological sample); 

b) detecting the antigen-antibody complexes which may be formed. 

Preferably, the biological sample consists of a fluid, for example a human or animal 
serum, blood or biopsies. 

Any conventional procedure may be used to carry out such a detection of the antigen- 
1 5 antibody complexes which may be formed. 

By way of example, a preferred method uses immunoenzymatic procedures based on 
the ELBA technique, immunofluorescence procedures or radioimmunological procedures (RIA), and 
the like. 

Accordingly, the invention also relates to the polypeptides according to the invention, 
20 labelled with the aid of a suitable label such as a label of the enzymatic, fluorescent or radioactive 
type. 

Such methods comprise, for example, the following steps: 
deposition of defined quantities of a polypeptide composition according to the invention into 
the wells of a microtitre plate, 
25 - introduction, into the said wells, of increasing dilutions of serum, or of a different biological 
sample as defined above, which has to be analysed, 
incubation of the microplate, 

introduction, into the wells of the microtitre plate, of labelled antibodies directed against 
human or animal immunoglobulins, these antibodies having been labelled with the aid of an enzyme 
30 selected from those which are capable of hydrolyzing a substrate, thereby modifying the absorption of 
the radiation of the latter, at least at a defined wavelength, for example at 550 nm, 

detection, by comparison with a control, of the quantity of substrate hydrolyzed. 

The invention also relates to a kit or set for the detection and/or the identification of 
bacteria belonging to the species Chlamydia trachomatis or to an associated microorganism, 
35 characterized in that it comprises the following components: 
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a polypeptide according to the invention, 

where appropriate, the reagents for constituting the medium appropriate for the 
immunological or specific reaction, 

the reagents allowing the detection of the antigen-antibody complexes produced by the 
5 immunological reaction between the polypeptide(s) of the invention and the antibodies which may be 
present in the biological sample, it being possible for these reagents also to carry a label, or to be 
capable of being recognized in turn by a labelled reagent, more particularly in the case where the 
polypeptide according to the invention is not labelled, 

where appropriate, a reference biological sample (negative control) free of antibodies 
10 recognized by a polypeptide according to the invention, 

where appropriate, a reference biological sample (positive control) containing a 
predetermined quantity of antibodies recognized by a polypeptide according to the invention. 

According to the invention, the polypeptides, peptides, fusion proteins or other 
derivatives, or analogs thereof encoded by a polynucleotide sequence in SEQ ID No. 1, may be used 
15 as an immunogen to generate antibodies which immunospecifically bind such an immunogen. Such 
antibodies may include, but are not limited to, polyclonal and monoclonal antibodies, humanized or 
chimeric antibodies, single chain antibodies, Fab fragments, F(ab')2 fragments, fragments produced by 
a Fab expression library, anti-idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of 
the above. In a specific embodiment, the antibody to a polypeptide, peptide or other derivative, or 
20 analog thereof encoded by a polynucleotide sequence in SEQ ID No. 1 is a bispecific antibody (see 
generally, e.g. Fanger and Drakeman, 1995, Drug News and Perspectives 8: 133-137). Such a 
bispecific antibody is genetically engineered to recognize both (1) an epitope and (2) one of a variety 
of "trigger" molecules, e.g. Fc receptors on myeloid cells, and CD3 and CD2 on T cells, that have 
been identified as being able to cause a cytotoxic T-cell to destroy a particular target. Such bispecific 
25 antibodies can be prepared either by chemical conjugation, hybridoma, or recombinant molecular 
biology techniques known to the skilled artisan. 

Various procedures known in the art may be used for the production of polyclonal 
antibodies to a polypeptide, peptide or other derivative, or analog thereof encoded by a polynucleotide 
sequence in SEQ ID No. L For the production of antibody, various host animals can be immunized 
30 by injection with a polypeptide, or peptide or other derivative, or analog thereof, including but not 
limited to rabbits, mice, rats, etc. Various adjuvants, depending on the host species, may be used to 
increase the immunological response, including but not limited to Stimulon™ QS-21 (Aquila 
Biopharmaceuticals, Inc., Framingham, MA), MPL™ (3-O-deacylated monophosphoryl lipid A; RIBI 
ImmunoChem Research, Inc., Hamilton, MT), aluminum phosphate, IL-12 (Genetics Institute, 
35 Cambridge, MA), Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, 
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surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanins, dinitrophenol, BCG (bacille Calmette-Guerin), and corynebacterium 
parvum. Alternatively, polyclonal antibodies may be prepared by purifying, on an affinity column 
onto which a polypeptide according to the invention has been previously attached, the antibodies 
5 contained in the serum of patients infected with a bacterium belonging to the species Chlamydia 
trachomatis. 

For preparation of monoclonal antibodies directed toward a polypeptide, peptide or 
other derivative, or analog, any technique which provides for the production of antibody molecules by 
continuous cell lines in culture may be used. For example, the hybridoma technique originally 

10 developed by Kohler and Milstein (1975, Nature 256:495^97), as well as the trioma technique, the 
human B-cell hybridoma technique (Kozbor et al y 1983, Immunology Today 4:72), and the EBV- 
hybridorna technique to produce human monoclonal antibodies (Cole et a/., 1985, in Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). In an additional embodiment of the 
invention, monoclonal antibodies can be produced in germ-free animals utilizing technology 

15 described in PCT/US90/02545. In another embodiment of the invention, transgenic non-human 
animals can be used for the production of human antibodies utilizing technology described in WO 
98/24893 and WO 96/33735. According to the invention, human antibodies may be used and can be 
obtained by using human hybridomas (Cote et al, 1983, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030) 
or by transforming human B cells with EBV virus in vitro (Cole et al, 1985, in Monoclonal 

20 Antibodies and Cancer Therapy. Alan R. Liss, pp. 77-96). In fact, according to the invention, 
techniques developed for the production of ^chimeric antibodies^, (Morrison et al, 1984, PROC. 
NATL. ACAD. SCI U.S.A. 81:6851-6855; Neuberger et aL, 1984, Nature 312:604-608: Takeda et 
al % 1985, Nature 114:452-454) by splicing the genes from a mouse antibody molecule specific for a 
polypeptide, peptide or other derivative, or analog together with genes from a human antibody 

25 molecule of appropriate biological activity can be used; such antibodies are within the scope of this 
invention. 

According to the invention, techniques described for the production of single chain 
antibodies (U.S. Patent 4,946,778) can be adapted to produce polypeptide or peptide-specific single 
chain antibodies. An additional embodiment of the invention utilizes the techniques described for the 
30 construction of Fab expression libraries (Huse et al, 1989, Science 246:1275-1281) to allow rapid 
and easy identification of monoclonal Fab fragments with the desired specificity for polypeptides, 
derivatives, or analogs. 

Antibody fragments which contain the idiotype of the molecule can be generated by 
known techniques. For example, such fragments include but are not limited to: the F(ab*)2 fragment 
35 which can be produced by pepsin digestion of the antibody molecule; the Fab' fragments which can be 
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generated by reducing the disulfide bridges of the F(ab*) 2 fragment, the Fab fragments which can be 
generated by treating the antibody molecule with papain and a reducing agent, and Fv fragments. 

In addition, techniques have been developed for the production of chimerized (See, e.g. . 
Boss, M. et al., U.S. Patent No. 4,816,397; and Cabilly, S. et al., U.S. Patent No. 5,585,089 each of 
5 which is incorporated herein by reference in its entirety) humanized antibodies (See, e.g., Queen, U.S. 
Patent No. 5,585,089, which is incorporated herein by reference in its entirety.) An immunoglobulin 
light or heavy chain variable region consists of a "framework" region interrupted by three 
hypervariable regions, referred to as complementarily determining regions (CDRs). The extent of the 
framework region and CDRs have been precisely defined (See, "Sequences of Proteins of 

10 Immunological Interest", Kabat, E. et al., U.S. Department of Health and Human Services (1983)). 
Briefly, humanized antibodies are antibody molecules from non-human species having one or more 
CDRs from the non-human species and a framework from a human immunoglobulin molecule. 

The antibodies of the invention may also be labelled in the same manner as described 
above for the nucleic probes of the invention such as an enzymatic, fluorescent or radioactive type 

15 labelling. 

The invention relates, in addition, to a method for the detection and/or the 
identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 
microorganism in a biological sample, characterized in that it comprises the following steps: 

a) bringing the biological sample (biological tissue or fluid) into contact with a mono- or 
20 polyclonal antibody according to the invention (under conditions allowing an immunological reaction 

between the said antibodies and the polypeptides of the bacterium belonging to the species Chlamydia 
trachomatis or to an associated microorganism which may be present in the biological sample, that is, 
under conditions suitable for the formation of immune complexes); 

b) detecting the antigen-antibody complex which may be formed. 

25 Also falling within the scope of the invention is a kit or set for the detection and/or 

the identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 
microorganism, characterized in that it comprises the following components: 

a polyclonal or monoclonal antibody according to the invention, labelled where appropriate; 
where appropriate, a reagent for constituting the medium appropriate for carrying out the 
30 immunological reaction; 

a reagent allowing the detection of the antigen-antibody complexes produced by the 
immunological reaction, it being possible for this reagent also to carry a label, or to be capable of 
being recognized in turn by a labelled reagent, more particularly in the case where the said 
monoclonal or polyclonal antibody is not labelled; 
35 - where appropriate, reagents for carrying out the lysis of the cells in the sample tested. 
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The principle of the DNA chip which was explained above may also be used to 
produce protein «chips» on which the support has been coated with a polypeptide or an antibody 
according to the invention, or arrays thereof, in place of the DNA. These protein «ctiips» make it 
possible, for example, to analyse the biomolecular interactions (BIA) induced by the affinity capture 
5 of target analytes onto a support coated, for example, with proteins, by surface plasma resonance 
(SPR). Reference may be made, for example, to the techniques for coupling proteins onto a solid 
support which are described in EP 524 800 or to the methods describing the use of biosensor-type 
protein chips such as the BIAcore-type technique (Pharmacia) (Arlinghaus eta]., 1997, Krone et al., 
1997, Chatelier et al., 1995). These polypeptides or antibodies according to the invention, capable of 

10 specifically binding antibodies or polypeptides derived from the sample to be analysed, may thus be 
used in protein chips for the detection and/or the identification of proteins in samples. The said 
protein chips may in particular be used for infectious diagnosis and may preferably contain, per chip, 
several polypeptides and/or antibodies of the invention of different specificity, and/or polypeptides 
and/or antibodies capable of recognizing microorganisms different from Chlamydia trachomatis, 

15 Accordingly, the subject of the present invention is also the polypeptides and the 

antibodies according to the invention, characterized in that they are immobilized on a support, in 
particular of a protein chip. 

The protein chips, characterized in that they contain at least one polypeptide or one 
antibody according to the invention immobilized on the support of the said chip, also form part of the 

20 invention. 

The invention comprises, in addition, a protein chip according to the invention, 
characterized in that it contains, in addition, at least one polypeptide of a microorganism different 
from Chlamydia trachomatis or at least one antibody directed against a compound of a microorganism 
different from Chlamydia trachomatis, immobilized on the support of the said chip. 

25 The invention also relates to a kit or set for the detection and/or the identification of 

bacteria belonging to the species Chlamydia trachomatis or to an associated microorganism, or for the 
detection and/or the identification of a microorganism characterized in that it comprises a protein chip 
according to the invention. 

The subject of the present invention is also a method for the detection and/or the 

30 identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 
microorganism in a biological sample, characterized in that it uses a nucleotide sequence according to 
the invention. 

More particularly, the invention relates to a method for the detection and/or the 
identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 
35 microorganism in a biological sample, characterized in that it comprises the following steps: 
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a) where appropriate, isolation of the DNA from the biological sample to be analysed, or 
optionally production of a cDNA from the RNA in the biological sample; 

b) specific amplification of the DNA of bacteria belonging to the species Chlamydia trachomatis 
or to an associated microorganism with the aid of at least one primer according to the invention; 

5 c) detection of the amplification products. 

These may be detected, for example, by the molecular hybridization technique using a 
nucleic probe according to the invention. This probe will be advantageously labelled with a 
nonradioactive (cold probe) or radioactive element. 

For the purposes of the present invention, «DNA in the biological sarnple» or «DNA 
10 contained in the biological sample» will be understood to mean either the DNA present in the 
biological sample considered, or optionally the cDNA obtained after the action of a reverse 
transcriptase-type enzyme on the RNA present in the said biological sample. 

Another aim of the present invention consists in a method according to the invention, 
characterized in that it comprises the following steps: 
15 a) bringing a nucleotide probe according to the invention into contact with a biological sample, 
the DNA contained in the biological sample having, where appropriate, been previously made 
accessible to hybridization, under conditions allowing the hybridization of the probe to 
complementary base pairs of the DNA of a bacterium belonging to the species Chlamydia trachomatis 
or to an associated microorganism; 
20 b) detecting the hybridization complex formed between the nucleotide probe and the DNA in the 
biological sample. 

The present invention also relates to a method according to the invention, 
characterized in that it comprises the following steps: 

a) bringing a nucleotide probe immobilized on a support according to the invention into contact 
25 with a biological sample, the DNA in the sample having, where appropriate, been previously made 

accessible to hybridization, under conditions allowing the hybridization of the probe to the DNA of a 
bacterium belonging to the species Chlamydia trachomatis or to an associated microorganism; 

b) bringing the hybrid formed between the nucleotide probe immobilized on a support and the 
DNA contained in the biological sample, where appropriate after removal of the DNA in the 

30 biological sample which has not hybridized with the probe, into contact with a labelled nucleotide 
probe according to the invention; 

c) detecting the new hybrid formed in step b). 

According to an advantageous embodiment of the method for the detection and/or the 
identification defined above, it is characterized in that, prior to step a), the DNA in the biological 
35 sample is primer-extended and/or amplified beforehand with the aid of at least one primer according 
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to the invention. 

The invention relates, in addition, to a kit or set for the detection and/or the 
identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 
microorganism, characterized in that it comprises the following components: 
5 a) a nucleotide probe according to the invention; 

b) where appropriate, the reagents necessary for carrying out a hybridization reaction; 

c) where appropriate, at least one primer according to the invention as well as the reagents (e.g., 
polymerase and/or deoxynucleotide triphosphates) necessary for a DNA amplification reaction. 

The invention also relates to a kit or set for the detection and/or the identification of 
10 bacteria belonging to the species Chlamydia trachomatis or to an associated microorganism, 
characterized in that it comprises the following components: 

a) a nucleotide probe, called capture probe, according to the invention; 

b) an oligonucleotide probe, called detection probe, according to the invention; 

c) where appropriate, at least one primer according to the invention as well as the reagents (e.g., 
1 5 polymerase and/or deoxynucleotide triphosphates) necessary for a DNA amplification reaction. 

The invention also relates to a kit or set for the detection and/or the identification of 
bacteria belonging to the species Chlamydia trachomatis or to an associated microorganism, 
characterized in that it comprises the following components: 
a) at least one primer according to the invention; 

20 b) where appropriate, the reagents necessary for carrying out a DNA amplification reaction; 

c) where appropriate, a component which makes it possible to check the sequence of the 
amplified fragment, more particularly an oligonucleotide probe according to the invention. 

The invention relates, in addition, to a kit or set for the detection and/or the 
identification of bacteria belonging to the species Chlamydia trachomatis or to an associated 

25 microorganism, or for the detection and/or the identification of a microorganism characterized in that 
it comprises a DNA chip according to the invention. 

The invention also relates to a method or to a kit or set according to the invention for 
the detection and/or the identification of bacteria belonging to the species Chlamydia trachomatis, 
characterized in that the said primer and/or the said probe according to the invention are chosen from 

30 the nucleotide sequences specific to the species Chlamydia trachomatis, in that the said polypeptides 
according to the invention are chosen from the polypeptides specific to the species Chlamydia 
trachomatis and in that the said antibodies according to the invention are chosen from the antibodies 
directed against the polypeptides according to the invention chosen from the polypeptides specific to 
the species Chlamydia trachomatis. 

35 Preferably, the said method or the said kit or set above according to the invention, for 
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the detection and/or the identification of bacteria belonging to the species Chlamydia trachomatis is 
characterized in that the said primer and/or the said probe or the said polypeptides are chosen from the 
nucleotide sequences or polypeptides according to the invention which have been identified as being 
specific to the species Chlamydia trachomatis and in that the said antibodies according to the 
5 invention are chosen from the antibodies directed against the polypeptides according to the invention 
chosen from the polypeptides identified as being specific to the species Chlamydia trachomatis. 

The invention relates, in addition, to a method or a kit or set according to the 
invention for the diagnosis of predispositions to, or of a condition caused by, genital diseases which 
are induced or worsened by a Chlamydia trachomatis infection. 
10 The invention also relates to a method or a kit or set according to the invention for the 

diagnosis of predispositions to, or of conditions caused by, eye diseases induced or worsened by a 
Chlamydia trachomatis infection. 

The invention also relates to a method or a kit or set according to the invention for the 
diagnosis of predispositions to, or of conditions caused by, systemic diseases, in particular of the 
15 lymphatic system, which are induced or worsened by a Chlamydia trachomatis infection. 

According to another aspect, the subject of the invention is the use of polypeptides 
according to the invention, of cells transformed with a vector according to the invention and/or of 
transformed animals according to the invention, for the biosynthesis or the biodegradation of organic 
or inorganic compounds. 

20 As has been mentioned above, the nucleotide sequences of the invention were 

identified by homology with sequences known to encode, for example, polypeptides or fragments of 
enzymatic polypeptides involved in the biosynthesis or the biodegradation of organic or inorganic 
molecules. 

It is thus possible to use the said polypeptides of the invention in a similar manner for 
25 the biosynthesis or the biodegradation of organic or inorganic compounds of industrial or therapeutic 
interest (called compounds of interest). 

Among these polypeptides, there may be mentioned in particular the enzymes 
involved in metabolism, such as the proteolytic enzymes, amino transferases, glucose metabolism, or 
the enzymes which may be used in the biosynthesis of sugars, amino acids, fatty acids, polypeptides, 
30 nucleotides, nucleic acids or any other organic or inorganic compound or in the biodegradation of 
organic or inorganic compounds. 

Among these polypeptides, there may be mentioned, in addition, the mutated or 
modified enzymes corresponding to mutated or modified polypeptides according to the invention 
which may also be used for the biosynthesis or the biodegradation of organic or inorganic compounds 
35 at the industrial level, such as, for example, the production of compounds of interest, the reprocessing 
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of manufacturing residues applied to the food industries, to the papermaking industry or to the 
chemical and pharmaceutical industries. 

The methods of biosynthesis or biodegradation of organic or inorganic compounds, 
characterized in that they use a polypeptide or one of its representative fragments according to the 
5 invention, transformed cells according to the invention and/or a transformed animal according to the 
invention, also form part of the invention. 

The invention relates, in addition, to the use of a nucleotide sequence according to the 
invention, of a polypeptide according to the invention, of an antibody according to the invention, of a 
cell according to the invention, and/or of a transformed animal according to the invention, for the 

10 selection of an organic or inorganic compound capable of modulating, regulating, inducing or 
inhibiting the expression of genes, and/or of modifying the cellular replication of eukaryotic or 
prokaryotic cells or capable of inducing, inhibiting or worsening the pathologies linked to an infection 
by Chlamydia trachomatis or one of its associated microorganisms. 

The invention also comprises screening assays that comprise method of selecting 

15 compounds capable of binding to a polypeptide, fusion polypeptide, or one of its representative 
fragments according to the invention, capable of binding to a nucleotide sequence according to the 
invention, or capable of recognizing an antibody according to the invention, and/or capable of 
modulating, regulating, inducing or inhibiting the expression of genes, and/or of modifying the growth 
or the cellular replication of eukaryotic or prokaryotic cells, or capable of inducing, inhibiting or 

20 worsening, in an animal or human organism, the pathologies linked to an infection by Chlamydia 
trachomatis or one of its associated microorganisms, characterized in that it comprises the following 
steps: 

a) bringing the said compound into contact with the said polypeptide, the said 
nucleotide sequence, with a transformed cell according to the invention and/or administering the said 

25 compound to a transformed animal according to the invention; 

b) determining the capacity of the said compound to bind with the said 
polypeptide or the said nucleotide sequence, or to modulate, regulate, induce or inhibit the expression 
of genes, or to modulate growth or cellular replication, or to induce, inhibit or worsen in the said 
transformed animal, the pathologies linked to an infection by Chlamydia trachomatis or one of its 

30 associated microorganisms. 

The transformed cells and/or animals according to the invention may advantageously 
serve as a model and may be used in methods for studying, identifying and/or selecting compounds 
capable of being responsible for pathologies induced or worsened by Chlamydia trachomatis, or 
capable of preventing and/or of treating these pathologies such as, for example, genital, eye or 

35 systemic diseases, especially of the lymphatic system. In particular, the transformed host cells, in 
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particular bacteria of the Chlamydia family whose transformation with a vector according to the 
invention may, for example, increase or inhibit its infectivity, or modulate the pathologies usually 
induced or worsened by the infection, may be used to infect animals in which the onset of pathologies 
will be monitored. These nontransformed animals, infected for example with transformed Chlamydia 
5 bacteria, may serve as a study model. In the same manner, the transformed animals according to the 
invention may, for example, exhibit predispositions to genital and/or eye and/or systemic diseases, 
especially of the lymphatic system, and thus be used in methods for selecting compounds capable of 
preventing and/or of treating the said diseases. The said methods using the said transformed cells 
and/or transformed animals form part of the invention. 

10 The compounds capable of being selected may be organic compounds such as 

polypeptides or carbohydrates or any other organic or inorganic compounds already known, or new 
organic compounds produced using molecular modelling techniques and obtained by chemical or 
biochemical synthesis, these techniques being known to persons skilled in the art. 

The said selected compounds may be used to modulate the growth and/or the cellular 

15 replication of Chlamydia trachomatis or any other associated microorganism and thus to control 
infection by these microorganisms. The said compounds according to the invention may also be used 
to modulate the growth and/or the cellular replication of all eukaryotic or prokaryotic cells, in 
particular tumour cells and infectious microorganisms, for which the said compounds will prove 
active, the methods which make it possible to determine the said modulations being well known to 

20 persons skilled in the art. 

Compound capable of modulating the growth of a microorganism is understood to 
designate any compound which makes it possible to act, to modify, to limit and/or to reduce the 
development, the growth, the rate of proliferation and/or the viability of the said microorganism. 

This modulation may be achieved, for example, by an agent capable of binding to a 

25 protein and thus of inhibiting or of potentiating its biological activity, or capable of binding to a 
membrane protein of the outer surface of a microorganism and of blocking the penetration of the said 
microorganism into the host cell or of promoting the action of the immune system of the infected 
organism directed against the said microorganism. This modulation may also be achieved by an agent 
capable of binding to a nucleotide sequence of a DNA or RNA of a microorganism and of blocking, 

30 for example, the expression of a polypeptide whose biological or structural activity is necessary for 
the growth or for the reproduction of the said microorganism. 

Associated microorganism is understood to designate in the present invention any 
microorganism whose gene expression may be modulated, regulated, induced or inhibited, or whose 
growth or cellular replication may also be modulated by a compound of the invention. Associated 

35 microorganism is also understood to designate in the present invention any microorganism containing 
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nucleotide sequences or polypeptides according to the invention. These microorganisms may, in some 
cases, contain polypeptides or nucleotide sequences identical or homologous to those of the invention 
may also be detected and/or identified by the detection and/or identification methods of kit according 
to the invention and may also serve as a target for the compounds of the invention. 
5 The invention relates to the compounds capable of being selected by a method of 

selection according to the invention. 

The invention also relates to a pharmaceutical composition comprising a compound 
chosen from the following compounds: 
a nucleotide sequence according to the invention; 
10 a polypeptide or fusion polypeptide according to the invention; 
a vector according to the invention; 
an antibody according to the invention; and 

a compound capable of being selected by a method of selection according to the invention, optionally 
in combination with a pharmaceutically acceptable vehicle or carrier. 
15 An effective quantity is understood to designate a sufficient quantity of the said 

compound or antibody, or of a polypeptide of the invention, which makes it possible to modulate the 
growth of Chlamydia trachomatis or of an associated microorganism. 

The invention also relates to a pharmaceutical composition according to the invention 
for the prevention or the treatment of an infection by a bacterium belonging to the species Chlamydia 
20 trachomatis or by an associated microorganism. 

The invention relates, in addition, to an immunogenic and/or vaccine composition, 
characterized in that it comprises one or more polypeptides according to the invention and/or one or 
more hybrid polypeptides according to the invention. 

The invention also comprises the use of a transformed cell according to the invention, 
25 for the preparation of a vaccine composition. 

The invention also relates to a vaccine composition, characterized in that it contains a 
nucleotide sequence according to the invention, a vector according to the invention and/or a 
transformed cell according to the invention. 

The invention also relates to the vaccine compositions according to the invention, for 
30 the prevention or the treatment of an infection by a bacterium belonging to the species Chlamydia 
trachomatis or by an associated microorganism. 

The invention also relates to the use of DNA encoding polypeptides of Chlamydia 
trachomatis, in particular antigenic determinants, to be formulated as vaccine compositions. In 
accordance with this aspect of the invention, the DNA of interest is engineered into an expression 
35 vector under the control of regulatory elements, which will promote expression of the DNA, i.e„ 
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promoter or enhancer elements. In one preferred embodiment, the promoter element may be cell- 
specific and permit substantial transcription of the DNA only in predetermined cells. The DNA may 
be introduced directly into the host either as naked DNA (U.S. Patent No. 5,679,647 'incorporated 
herein by reference in their entirety) or formulated in compositions with other agents which may 
5 facilitate uptake of the DNA including viral vectors, i.e., adenovirus vectors, or agents which facilitate 
immunization, such as bupivicaine and other local anesthetics (U.S. Patent 5,593,972 incorporated 
herein by reference in their entirety), saponins (U.S. Patent 5,739,118 incorporated herein by 
reference in their entirety) and cationic polyamines (published international application WO 96/10038 
incorporated herein by reference in their entirety). 
10 The DNA sequence encoding the antigenic polypeptide and regulatory element may 

be inserted into a stable cell line or cloned microorganism, using techniques, such as targeted 
homologous recombination, which are well known to those of skill in the art, and described e.g., in 
Chappel, U.S. Patent No. 4,215,051; Skoultchi, WO 91/06667 each of which is incorporated herein by 
reference in its entirety. 

15 Such cell lines and microorganisms may be formulated for vaccine purposes. In yet 

another embodiment, the DNA sequence encoding the antigenic polypeptide and regulatory element 
may be delivered to a mammalian host and introduced into the host genome via homologous 
recombination (See, Chappel, U.S. Patent No. 4,215,051; Skoultchi, WO 91/06667 each of which is 
incorporated herein by reference in its entirety. 

20 Preferably, the immunogenic and/or vaccine compositions according to the invention 

intended for the prevention and/or the treatment of an infection by Chlamydia trachomatis or by an 
associated microorganism will be chosen from the immunogenic and/or vaccine compositions 
comprising a polypeptide or one of its representative fragments corresponding to a protein, or one of 
its representative fragments, of the cellular envelope of Chlamydia trachomatis. The vaccine 

25 compositions comprising nucleotide sequences will also preferably comprise nucleotide sequences 
encoding a polypeptide or one of its fragments corresponding to a protein, or one of its representative 
fragments, of the cellular envelope of Chlamydia trachomatis. 

Among these preferred immunogenic and/or vaccine compositions, the most preferred 
are those comprising a polypeptide or one of its representative fragments, or a nucleotide sequence or 

30 one of its representative fragments whose sequences are chosen from the nucleotide or amino acid 
sequences identified in this functional group and listed above. 

The polypeptides of the invention or their representative fragments entering into the 
immunogenic compositions according to the invention may be selected by techniques known to 
persons skilled in the art, such as for example on the capacity of the said polypeptides to stimulate T 

35 cells, which results, for example, in their proliferation or the secretion of interleukins, and which 
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leads to the production of antibodies directed against the said polypeptides. 

In mice, in which a weight dose of the vaccine composition comparable to the dose 
used in humans is administered, the antibody reaction is tested by collecting serum followed by a 
study of the formation of a complex between the antibodies present in the serum and the antigen of 
5 the vaccine composition, according to the customary techniques. 

According to the invention, the said vaccine compositions will be preferably in 
combination with a pharmaceutically acceptable vehicle and, where appropriate, with one or more 
appropriate immunity adjuvants. 

Various types of vaccines are currently available for protecting humans against 
10 infectious diseases: attenuated live microorganisms (M. bovis - BCG for tuberculosis), inactivated 
microorganisms (influenza virus), acellular extracts (Bordetella pertussis for whooping cough), 
recombinant proteins (hepatitis B virus surface antigen), polysaccharides (pneumococci). Experiments 
are underway on vaccines prepared from synthetic peptides or from genetically modified 
microorganisms expressing heterologous antigens. Even more recently, recombinant plasmid DNAs 

15 carrying genes encoding protective antigens were proposed as an alternative vaccine strategy. This 
type of vaccination is carried out with a particular plasmid derived from an E. coli plasmid which 
does not replicate in vivo and which encodes only the vaccinal protein. Animals were immunized by 
simply injecting the naked plasmid DNA into the muscle. This technique leads to the expression of 
the vaccine protein in situ and to a cell-type (CTL) and a humoral type (antibody) immune response. 

20 This double induction of the immune response is one of the main advantages of the technique of 
vaccination with naked DNA. 

The vaccine compositions of the present invention can be evaluated in in vitro and in 
vivo animal models prior to host, e^, human, administration. For example, in vitro neutralization 
assays such as those described by Peterson et al. (1988) can be utilized. The assay described by 

25 Peterson et al. (1988) is suitable for testing vaccine compositions directed toward either Chlamydia 
trachomatis or Chlamydia pneumoniae. 

Briefly, hyper-immune antisera is diluted in PBS containing 5% guinea pig serum, as 
a complement source. Chlamydiae (10 4 IFU; inclusion forming units) are added to the antisera 
dilutions. The antigen-antibody mixtures are incubated at 37°C for 45 minutes and inoculated into 

30 duplicate confluent Hep-2 or HeLa cell monolayers contained in glass vials (e.g. . 15 by 45 mm), 
which have been washed twice with PBS prior to inoculation. The monolayer cells are infected by 
centrifugation at 1000X g for 1 hour followed by stationary incubation at 37° for 1 hour. Infected 
monolayers are incubated for 48 or 72 hours, fixed and stained with a Chlamydiae specific antibody, 
such as anti-MOMP for C trachomatis, etc. Inclusion-bearing cells are counted in ten fields at a 

35 magnification of 200X. Neutralization titer is assigned based on the dilution that gives 50% 
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inhibition as compared to control monoIayers/IFU. 

The efficacy of vaccine compositions can be determined in vivo by challenging 
animal models of Chlamydia trachomatis infection, e.g., guinea pigs or mice, with "the vaccine 
compositions. For example, in vivo vaccine composition challenge studies in the guinea pig model of 
5 Chlamydia trachomatis infection can be performed. Briefly, female guinea pigs weighing 450 to 500 
g are housed in an enviornmentally controlled room with a 12 hour light-dark cycle and immunized 
with vaccine compositions via a variety of immunization routes. Post-vaccination, guinea pigs are 
infected in the genital tract with the agent of guinea pig inclusion conjunctivitis (GPIC), which has 
been grown in HeLa or McCoy cells (Rank et al. (1988)). Each animal receives approximately 

10 1 .4x 10 7 inclusion forming units (IFU) contained in 0.05 ml of sucrose-phosphate-glutamate buffer, pH 
7.4 (Schacter, J. (1980)). The course of infection monitored by determining the percentage of 
inclusion-bearing cells by indirect immunofluorescence with GPIC specific antisera, or by Giemsa- 
stained smear from a scraping from the genital tract (Rank et al. (1988)). Antibody titers in the serum 
is determined by an enzyme-linked immunosorbent assay. 

15 Alternatively, in vivo vaccine composition challenge studies can be performed in the 

murine model of Chlamydia trachomatis (Morrison et al., 1995). Briefly, female mice 7 to 12 weeks 
of age receive 2.5 mg of depoprovera subcutaneously at 10 and 3 days before vaginal infection. Post- 
vaccination, mice are infected in the genital tract with 1,500 inclusion-forming units of Chlamydia 
trachomatis contained in 5ml of sucrose-phosphate-glutamate buffer, pH. 7.4. The course of infection 

20 is monitored by determining the percentage of inclusion-bearing cells by indirect immunofluorence 
with Chlamydia trachomatis specific antisera, or by a Giemsa-stained smear from a scraping from the 
genital tract of an infected mouse. The presence of antibody titers in the serum of a mouse is 
determined by an enzyme-linked immunosorbent assay. 

The vaccine compositions comprising nucleotide sequences or vectors into which the 

25 said sequences are inserted are in particular described in International Application No. WO 90/1 1092 
and also in International Application No. WO 95/1 1307. 

The nucleotide sequence constituting the vaccine composition according to the 
invention may be injected into the host after having been coupled to compounds which promote the 
penetration of this polynucleotide inside the cell or its transport up to the cell nucleus. The resulting 

30 conjugates may be encapsulated into polymeric microparticles, as described in International 
Application No. WO 94/27238 (Medisorb Technologies International). 

According to another embodiment of the vaccine composition according to the 
invention, the nucleotide sequence, preferably a DNA, is complexed with the DEAE-dextran (Pagano 
t al., 1967) or with nuclear proteins (Kaneda et al., 1989), with lipids (Feigner et al., 1987) or 

35 encapsulated into liposomes (Fraley et al., 1980) or alternatively introduced in the form of a gel 
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facilitating its transfection into the cells (Midoux et al., 1993, Pastore et ah, 1994). The 
polynucleotide or the vector according to the invention may also be in suspension in a buffer solution 
or may be combined with liposomes. 

Advantageously, such a vaccine will be prepared in accordance with the technique 
5 described by Tacson et al. or Huygen et al. in 1996 or alternatively in accordance with the technique 
described by Davis et al. in International Application No. WO 95/1 1307. 

Such a vaccine may also be prepared in the form of a composition containing a vector 
according to the invention, placed under the control of regulatory elements allowing its expression in 
humans or animals. It is possible, for example, to use, as vector for the in vivo expression of the 
10 polypeptide antigen of interest, the plasmid pcDNA3 or the plasmid pcDNAl/neo, both marketed by 
Invitrogen (R&D Systems, Abingdon, United Kingdom). It is also possible to use the plasmid 
VlJns.tPA, described by Shiver et al. in 1995. Such a vaccine will advantageously comprise, in 
addition to the recombinant vector, a saline solution, for example a sodium chloride solution. 

The immunogenic compositions of the invention can be utilized as part of methods of 

15 immunization, wherein such methods comprise administering to a host, e.g., a human host, an 
immunizing amount of the immunogenic compositions of the invention. In a preferred embodiment, 
the method of immunizing is a method of immunizing against Chlamydia trachomatis. 

A pharmaceutical^ acceptable vehicle is understood to designate a compound or a 
combination of compounds entering into a pharmaceutical or vaccine composition which does not 

20 cause side effects and which makes it possible, for example, to facilitate the administration of the 
active compound, to increase its life and/or its efficacy in the body, to increase its solubility in 
solution or alternatively to enhance its preservation. These pharmaceutical^ acceptable vehicles are 
well known and will be adapted by persons skilled in the art according to the nature and the mode of 
administration of the active compound chosen. 

25 As regards the vaccine formulations, these may comprise appropriate immunity 

adjuvants which are known to persons skilled in the art, such as, for example, aluminium hydroxide, a 
representative of the family of muramyl peptides such as one of the peptide derivatives of N-acetyl- 
muramyl, a bacterial lysate, or alternatively incomplete Freund ' s adjuvant, Stimulon™ QS-21 (Aquila 
B ^pharmaceuticals, Inc., Framinham, MA), MPL™ (3-0-deacylated monophosphoryl lipid A; RTBI 

30 ImmunoChem Research, Inc., Hamilton, MT), aluminum phosphate, EL-12 (Genetics Institute, 
Cambridge, MA). 

Preferably, these compounds will be administered by the systemic route, in particular 
by the intravenous route, by the intranasal, intramuscular, intradermal or subcutaneous route, or by the 
oral route. More preferably, the vaccine composition comprising polypeptides according to the 
35 invention will be administered several times, spread out over time, by the intradermal or subcutaneous 
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route. 

Their optimum modes of administration, dosages and galenic forms may be 
determined according to criteria which are generally taken into account in establishing* a treatment 
adapted to a patient, such as for example the patient's age or body weight, the seriousness of his 
5 general condition, tolerance of the treatment and the side effects observed. 

The invention comprises the use of a composition according to the invention for the 
treatment or the prevention of genital diseases which are induced or worsened by Chlamydia 
trachomatis. 

Finally, the invention comprises the use of a composition according to the invention 
10 for the treatment or the prevention of eye diseases which are induced or worsened by the presence of 
Chlamydia trachomatis. 

Finally, the invention comprises the use of a composition according to the invention 
for the treatment or the prevention of systemic diseases, especially of the lymphatic system, which are 
induced or worsened by the presence of Chlamydia trachomatis. 
15 0ther characteristics and advantages of the invention appear in the following 

examples and figures: 



Legend to the figures : 

Figure 1 : Line for the production of Chlamydia trachomatis sequences 
20 Figure 2 : Analysis of the sequences and assembling 
Figure 3 : Finishing techniques 

Figure 3a) : Assembly map 

Figure 3b) : Determination and use of the orphan ends of the contigs 

25 EXAMPLES 
Cells 

The Chlamydia trachomatis LGV2 strain used is identified to have over 98% 
homology with the outer membrane protein sequences ompl (CHTMOMPA) and omp2 
(CHTOMP2A) of the Chlamydia trachomatis serovar L2/434/Bu strain. 
30 The Chlamydia trachomatis LGV2 strain is cultured on mouse fibroblasts (McCoy 

cells), obtained from the American Type Culture Collection, under the reference ATCC CRL-1696. 



35 



Culture of the cells 

The mouse fibroblasts are cultured in 75-ml cell culture flasks (Corning). The culture 
medium is Dulbecco ' s modified cell culture medium (Gibco BRL No. 04101965) supplemented with 
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MEM amino acids (Gibco BRL - No. 04301 140) L (5 ml per 500 ml of medium) and 5% foetal calf 
serum (Gibco BRL No. 10270 batch 40G8260K) without antibiotics or antifungals. 

The cell culture stock is maintained in the following manner. The cell' cultures are 
examined under an inverted microscope. 24 hours after confluence, each cellular lawn is washed with 
5 PBS (Gibco BRL No. 041 14190), rinsed and then placed for 5 min in an oven in the presence of 3 ml 
of trypsine (Gibco BRL No. 25200056). The cellular lawn is then detached and then resuspended in 
120 ml of culture medium, the whole is stirred in order to make the cellular suspension homogeneous. 
30 ml of this suspension are then distributed per cell culture flask. The flasks are kept in a C0 2 oven 
(5%) for 48 hours at a temperature of 37°C. The cell stock is maintained so as to have available daily 
10 16 flasks of subconfluent cells. It is these subconfluent cells which will be used so as to be infected 
with Chlamydia. 25-ml cell culture flasks are also used, these flasks are prepared in a similar manner 
but the volumes used for maintaining the cells are the following: 1 ml of trypsine, 28 ml of culture 
medium to resuspend the cells, 7 ml of culture medium are used per 25-ml flask. 

15 Infection of the cells with Chlamydia 

Initially, the Chlamydiae are obtained frozen (at -70°C), in suspension in a volume of 
1 millilitre. This preparation is slowly thawed, 500 pi are collected and brought into contact with 
subconfluent cells, which are obtained as indicated above, in a 25-ml cell culture flask, containing 
1 ml of medium, so as to cover the cells. The flask is then centrifuged at 2000 rpm in a «swing» rotor 

20 for microtitre plates, the centrifuge being maintained at a temperature of 35°C. After centrifugation, 
the two flasks are placed in an oven at 35°C for three hours. 6 ml of culture medium containing 
cycloheximide (1 ng/ml) are then added and the flask is stored at 35°C. After 48 hours, the level of 
infection is evaluated by direct immunofluorescence and by the cytopathogenic effect caused to the 
cells. 

25 

Direct immunofluorescence 

Starting with infected cells, which were obtained as indicated above, a cellular smear 
is deposited with a Pasteur pipette on a microscope slide. The cellular smear is fixed with acetone for 
10 minutes; after draining the acetone, the smear is covered with 30^1 of murine monoclonal 

30 antibodies directed against MOMP (major outer membrane protein) of Chlamydia (Syva, BiomSrieux) 
labelled with fluorescein isothiocyanate. The whole is then incubated in a humid chamber at a 
temperature of 37°C. The slides are then rinsed with water, slightly dried, and then after depositing a 
drop of mounting medium, a coverslip is mounted before reading. The reading is carried out with the 
aid of a fluorescence microscope equipped with the required filters (excitation at 490 nm, emission at 

35 520 nm). 
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Harvesting of the Chlamydia trachomatis 

After checking the infection by direct immunofluorescence, carried out as indicated 
above, the culture flasks are opened under a sterile cabinet, sterile glass beads with a diameter of the 
5 order of a millimeter are placed in the flask. The flask is closed and then vigorously stirred while 
being maintained horizontally, the cellular lawn at the bottom, so that the glass beads can have a 
mechanical action on the cellular lawn. Most of the cells are thus detached or broken; the effect of the 
stirring is observed under an optical microscope so as to ensure proper release of Chlamydiae. 

10 Large-scale infection of the cell cultures 

The product of the Chlamydiae harvest (culture medium and cellular debris) is 
collected with a pipette, and distributed into three cell culture flasks containing subconfluent L cells, 
obtained as indicated above. The cells thus inoculated are placed under gentle stirring (swing) in an 
oven at 35°C. After one hour, the flasks are kept horizontally in an oven so that the culture medium 

15 covers the cells for 3 hours. 30 ml of culture medium containing actydione (1 fig/ml) are then added to 
each of the flasks. The culture flasks are then stored at 35°C for 48 hours. The cells thus infected are 
examined under an optical microscope after 24 hours, the cytopathogenic effect is evaluated by the 
appearance of cytoplasmic inclusions which are visible under an inverted optical microscope. After 
48 hours, the vacuoles containing the Chlamydiae occupy the cytoplasm of the cell and push the cell 
20 nucleus sideways. At this stage, numerous cells are spontaneously destroyed and have left free 
elementary bodies in the culture medium. The Chlamydiae are harvested as described above and are 
either frozen at -80°C or used for another propagation. 

Purification of the Chlamydiae 

25 The product of the Chlamydia harvests, stored at -80°C, is thawed on a water bath at 

room temperature. After thawing, each tube is vigorously stirred for one minute and immersed for one 
minute in an ultrasound tank (BRANSON 1200); the tubes are then stirred by inverting before being 
centrifuged for 5 min at 2000 rpm. The supernatant is carefully removed and kept at cold temperature 
(ice). The supernatant is vigorously stirred and then filtered on nylon filters having pores of 5 microns 

30 in diameter on a support (Nalgene) allowing a delicate vacuum to be established under the nylon 
filter. For each filtration, three nylon filters are superposed; these filters are replaced after every 40 ml 
of filtrate. Two hundred milliliters of filtration product are kept at cold temperature, and then after 
stirring by inverting, are centrifuged at 10,000 rpm for 90 min, the supernatant is removed and the 
pellet is taken up in 10 ml of 10 mM Tris, vigorously vortexed and then centrifuged at 10,000 rpm for 

35 90 min. The supernatant is removed and the pellet is taken up in a buffer (20 mM Tris pH 8.0, 50 mM 
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KC1, 5 mM MgCl 2 ) to which 800 units of DNAse I (Boehringer) are added. The whole is kept at 37°C 
for one hour. One ml of 0.5 M EDTA is then added, and the whole is vortexed and frozen at -20°C. 

Preparation of the DNA 

The Chlamydiae purified above are thawed and subjected to a proteinase K 
(Boehringer) digestion in a final volume of 10 ml. The digestion conditions are the following: 
0.1 mg/ml proteinase K, 0.1 % SDS at 55°C, stirring every 10 min. The product of digestion is then 
subjected to a double extraction with phenol-chloroform, two volumes of ethanol are added and the 
DNA is directly recovered with a Pasteur pipette having one end in the form of a hook. The DNA is 
dried on the edge of the tube and then resuspended in 500 pi of 2 mM Tris pH 7.5. The DNA is stored 
at 4°C for at least 24 hours before being used for the cloning. 

Cloning of the DNA 

After precipitation, the DNA is quantified by measuring the optical density at 
260 nm. Thirty pg of Chlamydia DNA are distributed into 10 tubes of 1.5 ml and diluted in 300 pi of 
water. Each of the tubes is subjected to 10 applications of ultrasound lasting for 0.5 sec in a sonicator 
(Unisonix XL2020). The contents of the 10 tubes are then grouped and concentrated by successive 
extractions with butanol (Sigma B1888) in the following manner: two volumes of butanol are added 
to the dilute DNA mixture. After stirring, the whole is centrifuged for five minutes at 2500 rpm and 
the butanol is removed. This operation is repeated until the volume of the aqueous phase is less than 
1 ml. The DNA is then precipitated in the presence of ethanol and of 0.5 M sodium acetate pH 5.4, 
and then centrifuged for thirty minutes at 15,000 rpm at cold temperature (4°C). The pellet is washed 
with 75% ethanol, centrifuged for five minutes at 15,000 rpm and dried at room temperature. A tenth 
of the preparation is analysed on a 0.8% agarose gel. Typically, the size of the DNA fragments thus 
prepared is between 200 and 8000 base pairs. 

To allow the cloning of the DNA obtained, the ends are repaired. The DNA is 
distributed in an amount of 10 pg/tube, in the following reaction medium: 100 pi final volume, 1 H 
buffer (Biolabs 201L), 0.5 pi BSA 0.05 mg/ml, 0.1 mM dATP, 0.1 mM each of dGTP, dCTP or 
dTTP, 60,000 IU T4 DNA polymerase. The reaction is incubated for thirty minutes at 16°C. The 
contents of each of the tubes are then grouped before carrying out an extraction with phenol- 
chloroform and then precipitating the aqueous phase as described above. After this step, the DNA 
thus prepared is phosphorylated. For that, the DNA is distributed into tubes in an amount of 10 pg per 
tube, and then in a final volume of 50 pi, the reaction is prepared in the following manner: 1 mM 
ATP, 1 x kinase buffer, 10 IU T4 polynucleotide kinase (Biolabs 201L). The preparation is incubated 
for thirty minutes at 37°C. The contents of the tubes are combined and a phenol-chloroform extraction 
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and then a precipitation are carried out in order to precipitate the DNA. The latter is then suspended in 
1 ill of water and then the DNA fragments are separated according to their size on a 0.8% agarose gel 
(1 x TAE). The DNA is subjected to an electric field of 5 V/cm and then visualized on'a UV table. 
The fragments whose size varies between 1200 and 2000 base pairs are selected by cutting out the gel. 
5 The gel fragment thus isolated is placed in a tube and then the DNA is purified with the Qiaex kit 
(20021 Qiagen), according to the procedure provided by the manufacturer. 

Preparation of the vector 

14 fig of the cloning vector pGEM-5Zf (Promega P2241) are diluted in a final volume 
10 of 150 jil and are subjected to digestion with the restriction enzyme EcoRV 300 IU (Biolabs 195S) 
according to the protocol and with the reagents provided by the manufacturer. The whole is placed at 
37°C for 150 min and then distributed in the wells of a 0.8% agarose gel subjected to an electric field 
of 5 V/cm. The linearized vector is visualized on a UV table, isolated by cutting out the gel and then 
purified by the Qiaex kit (Qiagen 20021) according to the manufacturer's recommendations. The 

15 purification products are grouped in a tube, the volume is measured and then half the volume of 
phenol is added and the whole is vigorously stirred for 1 min. Half the volume of chloroform-isoamyl 
alcohol 24:1 is added and vigorously stirred for 1 min. The whole is centrifiiged at 15,000 rpm for 
5 min at 4°C, the aqueous phase is recovered and transferred into a tube. The DNA is precipitated in 
the presence of 0.3 M sodium acetate, pH 5.4 and 3 volumes of ethanol and placed at -20°C for 

20 1 hour. The DNA is then centrifuged at 15,000 rpm for 30 min at 4°C, the supernatant is removed 
while preserving the pellet, washed twice with 70% ethanol. After drying at room temperature, the 
DNA is suspended in 25 \x\ of water. 

Phosphorylation of the vector 

25 25 Ml of the vector prepared in the preceding step are diluted in a final volume of 

500 |il of the following reaction mixture: 

After repair, the DNA is subjected to a phenol-chloroform extraction and a 
precipitation, the pellet is then taken up in 10^1 of water, the DNA is quantified by measuring the 
optical density at 260 nm. The quantified DNA is ligated into the vector PGEm-5Zf(+) prepared by 

30 the restriction enzyme EcoRV and dephosphorylated (see preparation of the vector). The ligation is 
carried out under three conditions which vary in the ratio between the number of vector molecules and 
the number of insert molecules. Typically, an equimolar ratio, a ratio of 1 :3 and a ratio of 3: 1 are used 
for the ligations which are, moreover, carried out under the following conditions: vector 
PGEm-5Zf(+) 25 ng, cut DNA, ligation buffer in a final volume of 20^1 with T4 DNA Iigase 

35 (Amersham E70042X); the whole is then placed in a refrigerator overnight and then a phenol- 
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chloroform extraction and a precipitation are carried out in a conventional manner. The pellet is taken 
up in 5 ul of water. 

Transformation of the bacteria 
5 Plating of the bacteria 

Petri dishes containing LB Agar medium containing ampicillin (50ug/ml), Xgal 
(280 pg/ml) [5-bromo-4-chloro-indolyl-beta-D-galactopyranoside (Sigma B-4252)], IPTG (140 ug/ml) 
[isopropyl-beta-D-thiogalactoside (Sigma 1-6758)] are used, 50 and 100 ul of bacteria are plated for 
each of the ligations. The Petri dishes are placed upside down at 37°C for 15 to 16 hours in an oven. 
10 The number of «recombinant» positive clones is evaluated by counting the white colonies and the 
blue colonies which are thought to contain the vector alone. 

Evaluation of the «recombinant» positive clones : 

Ninety-four white colonies and two blue colonies are collected with the aid of sterile 
15 cones and are deposited at the bottom of the wells of plates designed for carrying out the 
amplification techniques. 30 pi of the following reaction mixture are added to each well: 1.7 mM 
MgCl 2 , 0.2 mM each of dATP, dCTP, dGTP and dTTP, two synthetic oligonucleotides corresponding 
to sequences flanking the cloning site on either side and orienting the synthesis of the DNA in a 
convergent manner (0.5 nM RP and PU primers, 1 U TAQ polymerase (GibcoBRL 18038-026)). 
20 The colonies thus prepared are subjected to a temperature of 94°C for 5 min and then 

to 30 thermal cycles composed of the following steps: 94°C for 40 s, 50°C for 30 s, 72°C for 180 s. 
The reaction is then kept for 7 min at 72°C and then kept at 4°C. 

The amplification products are deposited on an agarose gel (0.8%), stained with 
ethidium bromide, subjected to electrophoresis, and then analysed on an ultraviolet table. The 
25 presence of an amplification fragment having a size greater than 500 base pairs indicates the presence 
of an insert. The bacterial clones are then prepared so as to study the sequence of their insert. 

Sequencin g 

To sequence the inserts of the clones obtained as above, these were amplified by PGR 
30 on bacteria cultures carried out overnight using the primers for the vectors flanking the inserts. The 
sequence of the ends of these inserts (on average 500 bases on each side) was determined by 
automated fluorescent sequencing on an ABI 377 sequencer, equipped with the ABI Prism DNA 
Sequencing Analysis software (version 2.1.2). 



35 Analysis of the sequences 
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The sequences obtained by sequencing in a high-yield line (Figure 1) are stored in a 
database; this part of the production is independent of any treatment of the sequences. The sequences 
are extracted from the database, avoiding all the regions of inadequate quality, that is to say the 
regions for which uncertainties are observed on the sequence at more than 95%, After extraction, the 
5 sequences are introduced into a processing line, the diagram of which is described in Figure 2. In a 
first path of this processing line, the sequences are assembled by the Gap4 software from R. Staden 
(Bonfield et al, 1995) (OS UNIX/SUN Solaris); the results obtained by this software are kept in the 
form of two files which will be used for a subsequent processing. The first of these files provides 
information on the sequence of each of the contigs obtained. The second file represents all the clones 

10 participating in the composition of all the contigs as well as their positions on the respective contigs. 

The second processing path uses a sequence assembler (TIGR-Asmg assembler 
UNIX/SUN Solaris); the results of this second processing path are kept in the form of a file in the 
TIGR-Asmg format which provides information on the relationship existing between the sequences 
selected for the assembly. This assembler is sometimes incapable of linking contigs whose ends 

15 overlap over several hundreds of base pairs. 

The results obtained from these two assemblers are compared with the aid of the 
BLAST program, each of the contigs derived from one assembly path being compared with the 
contigs derived from the other path. 

For the two processing paths, the strict assembly parameters are fixed (95% 

20 homology, 30 superposition nucleotides). These parameters avoid 3 to 5% of the clones derived from 
eukaryotic cells being confused with sequences obtained from the clones derived from Chlamydia 
trachomatis. The eukaryotic sequences are however preserved during the course of this project; the 
strategy introduced, which is described below, will be designed, inter alia, not to be impeded by these 
sequences derived from contaminating clones. 

25 The results of these two assemblers are processed in a software developed for this 

project. This software operates on a Windows NT platform and receives, as data, the results derived 
from the STADEN software and/or the results derived from the TIGR-Asmg assembler, the software, 
results, after processing of the data, in the determination of an assembly map which gives the 
proximity relationship and the orientation of the contigs in relation to one another (Figure 3a). Using 

30 this assembly map, the software determines all the primers necessary for finishing the project. This 
treatment, which will be detailed below, has the advantage of distinguishing the isolated sequences 
derived from the contaminations, by the DNA eukaryotic cells, of the small-sized sequences clearly 
integrated into the project by the relationships which they establish with contigs. In order to allow, 
without any risk of error, the arrangement and the orientation of the contigs in relation to one another, 

35 a statistical evaluation of the accuracy of the names «naming» of sequence is made from the results of 
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«contigation». This evaluation makes it possible to give each of the clone plates, as well as each of 
the subsets of plates, a weight which is inversely proportional to probable error rate existing in the 
«naming» of the sequences obtained from this plate or from a subset of this plate. In spite of a low 
error rate, errors may occur throughout the steps of production of the clones and of the sequences. 
5 These steps are numerous, repetitive and although most of them are automated, others, like the 
deposition in the sequencers, are manual; it is then possible for the operator to make mistakes such as 
the inversion of two sequences. This type of error has a repercussion on the subsequent processing of 
the data, by resulting in relationships (between the contigs) which do not exist in reality, then in 
attempts at directed sequencing between the contigs which will end in failure. It is because of this that 
10 the evaluation of the naming errors is of particular importance since it allows the establishment of a 
probabilistic assembly map from which it becomes possible to determine all the clones which will 
serve as template to obtain sequences separating two adjacent contigs. Table 2 of parent U.S. 
Application Serial No. 60/107077 filed November 4, 1998, French application 97-15041 filed 
November 28, 1997 and French application 97-16034 filed December 17, 1997, each of which is 
15 incorporated by reference herein in its entirety, gives the clones and the sequences of the primers 
initially used during the initial operations. 

To avoid the step which consists in ordering and then preparing the clones by 
conventional microbiological means, outer and inner primers oriented towards the regions not yet 
sequenced are defined by the software. The primers thus determined make it possible to prepare, by 
20 PCR, a template covering the nonsequenced region. It is the so-called outer primers (the ones most 
distant from the region to be sequenced) which are used to prepare this template. The template is then 
purified and a sequence is obtained on each of the two strands during 2 sequencing reactions which 
each use one of the 2 inner primers. In order to facilitate the use of this approach, the two outer 
primers and the two inner primers are prepared and then stored on the same position of 4 different 
25 96-well plates. The two plates containing the outer primers are used to perform the PCRs which serve 
to prepare the templates. These templates will be purified on purification columns preserving the 
topography of the plates. Each of the sequences are obtained using primers situated on one and then 
on the other of the plates containing the inner primers. This distribution allows a very extensive 
automation of the process and results in a method which is simple to use for finishing the regions not 
30 yet sequenced. Table 3 of parent U.S. Application Serial No. 60/107077 filed November 4, 1998, 
French application 97-15041 filed November 28, 1997 and French application 97-16034 filed 
December 17, 1997, each of which is incorporated by reference herein in its entirety, gives the names 
and the sequences of the primers used for finishing Chlamydia trachomatis. 

Finally, a number of contigs exist in a configuration where one of their ends is not 
35 linked to any other contig end (Figure 3b) by a connecting clone relationship (a connecting clone is 
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defined as a clone having one sequence end on a contig and the other end of its sequence on another 
contig; furthermore, this clone must be derived from a plate or a subset of plates with adequate 
naming quality). For the Chlamydia trachomatis project, this particular case occurred 37 times. Two 
adjacent PCR primers orienting the synthesis of the DNA towards the end of the consensus sequence 
5 are defined for each of the orphan ends of the consensus sequence. The primer which is closest to the 
end of the sequence is called the inner primer whereas the primer which is more distant from the end 
of the sequence is called the outer primer. The outer primers are used to explore the mutual 
relationship between the orphan ends of the different contigs. The presence of a single PCR product 
and the possibility of amplifying this product unambiguously using the inner primers evokes the 
10 probable relationship between the contigs on which the primers which allowed the amplification are 
situated. This relationship will be confirmed by sequencing and will allow the connection between the 
orphan ends of the consensus sequences. This strategy has made it possible to obtain a complete map 
of the Chlamydia trachomatis chromosome and then to finish the project. 

15 Quality control 

AH the bases not determined with certainty in the chromosomal sequence were noted 
and the density of uncertainties was measured on the entire chromosome. The regions with a high 
density of uncertainties were noted and the PCR primers spanning these regions were drawn and are 
represented in Table 4 of parent U.S. Application Serial No. 60/107077 filed November 4, 1998, 

20 French application 97-15041 filed November 28, 1997 and French application 97-16034 filed 
December 17, 1997 each of which is incorporated by reference herein in its entirety. 

Data banks 

Local reorganizations of major public banks were used. The protein bank used 
25 consists of the nonredundant fusion of the Genpept bank (automated translation of GenBank, NCBI; 
Benson etal., 1996). 

The entire BLAST software (public domain, Altschul et al M 1990) for searching for 
homologies between a sequence and protein or nucleic data banks was used. The significance levels 
used depend on the length and the complexity of the region tested as well as the size of the reference 
30 bank. They were adjusted and adapted to each analysis. 

The results of the search for homologies between a sequence according to the 
invention and protein or nucleic data banks are presented and summarized in Table 1 below. 



35 



Table 1 : List of coding chromosome regions and homologies between these regions and the sequence 
banks. 
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Legend to Table 1: Open reading frames are identified with the GenMark software 
version 2.3A (GenePro), the template used is Chlamydia trachomatis of order 4 on a length of 
196 nucleotides with a window of 12 nucleotides and a minimum signal of 0.5. These reading frames 
are numbered in order of appearance on the chromosome, starting with ORF2 (ORF column). The 
5 positions of the beginning and of the end are then given in column 2 (position). When the position of 
the beginning is greater than the position of the end, this means that the region is encoded by the 
strand complementary to the sequence which was given in the sequence SEQ ED No. 1. 

All the putative products were subjected to a search for homology on GENPEPT 
(release 103 for SEQ ID No. 2 to SEQ ID No. 1076 and release 108 for SEQ ID No. 1077 to SEQ ID 

10 No. 1197 with the BLASTp software (Altschul etal. 1990), with, as parameters, the default 
parameters with the exception of the expected value E set at 10* 5 (for SEQ E> No. 2 to SEQ ID No. 
1076) and P value set at e 10 (for SEQ ID No. 1077 to SEQ ID No. 1197). Subsequently, only the 
identities greater than 30% (1% column) were taken into account. The description of the most 
homologous sequence is given in the Homology column; the identifier for the latter sequence is given 

15 in the ID column and the animal species to which this sequence belongs is given in the Species 
column. The Homology score is evaluated by the sum of the blast scores for each region of homology 
and reported in the Score column. Table 1 also reflects data from additional ORF finder programs as 
defined below. 



20 Materials and methods: transmembrane domains : 

The DAS software was used as recommended by the authors (Cserzo et al., 1997). 

This method uses, to predict the transmembrane domains, templates derived from a 
sampling of selected proteins. All the regions for which a «Cutoff» greater than 1.5 was found by the 
program were taken into account. 

25 

Additional ORF Finder Programs 

For this analysis, two additional ORF finder programs were used to predict potential 
open reading frames of a minimum length of 74 amino acids; Glimmer (Salzberg, S.L., Delcher, A., 
Kasif, S., and W. White. 1998. Microbial gene identification using interpolated Markov models. 
30 Nucleic Acids Res. 26:544-548.), and an in-house written program. The in-house program used a 
very simple search algorithm. The analysis required the that the genomic DNA sequence text be in 
the 5' to 3' direction, the genome is circular, and that TAA, TAG, and TGA are stop codons. The 
search parameters were as follows: 

(1) A search for an ORF that started with a GTG codon was performed. If no GTG codons 
35 were found, then a search for an ATG codon was performed. However, if a GTG codon was found, 
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then a search downstream for a ATG codon was performed. All start and stop nucleotide positions 
were recorded. 

(2) A search for an ORF that started with a TTG codon was performed. If no TTG codons 
were found, then a search for a ATG codon was performed. However, if a TTG codon was found, 

5 then a search downstream for a ATG codon was performed. All start and stop nucleotide positions 
were recorded. 

(3) The analysis described in steps 1 and 2 were repeated for the opposite strand of DNA 
sequence. 

(4) A search for ORFs that determined all ORF lengths using start and stop positions in the 
1 0 same reading frames was performed. 

(5) All ORFs whose DNA length was less than 225 nucleotides were eliminated from the 

search. 

Surface Exposed Protein Search Criteria 

15 Potential cell surface vaccine targets arc outer membrane proteins such as porins, 

lipoproteins, adhesions and other non-integral proteins. In Chlamydia psittaci, the major 
immunogens is a group of putative outer membrane proteins (POMPs) and no homologs have been 
found in Chlamydia trachomatis and Chlamydia trachomatis by traditional analysis (Longbottom, D., 
Russell, M., Dunbar, S.M., Jones, G.E., and A.J. Herring. 1998. Molecular Cloning and 

20 Characterization of the Genes Coding for the Highly Immunogenic Cluster of 90-Kilodalton Envelope 
Proteins from Chlamydia psittaci Subtype That Causes Abortion in Sheep. Infect Immun 66:1317- 
1324.) However, utilizing the criteria described below, several ORFs encoding outer membrane 
proteins have been identified in Chlamydia trachomatis, all of which may represent vaccine 
candidates. Any ORF which met any one of the criteria described below were considered to encode a 

25 surface exposed protein. 

Protein homology searches of the translated ORFs were done using the Blastp 2.0 tool 
(Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and D.J. Lipman. 
1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. 
Nucleic Acids Res. 25:3389-3402). An ORF was labeled surface exposed if the translated ORF had 

30 homology to a known, or hypothetical, or putative surface exposed protein with a P score less than 

Most, if not all, proteins that are localized to the membrane of bacteria, via a 
secretory pathway, contain a signal peptide. The software program SignalP, was used to analyze the 
amino acid sequence of an ORF for such a signal peptide (Nielsen, H., Engelbrecht. J., Brunak, S M 
35 and G. von Heijne. 1997. Identification of prokaryotic and eukaryotic signal peptides and prediction 
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of their cleavage sites. Protein Engineering 10:1-6.) The first 60 N-terminal amino acids of each ORF 
were analyzed by SignalP using the Gram-negative software database. The output generates four 
separate values, maximum C, maximum Y, maximum S, and mean S. The S-score, or signal region, is 
the probability of the position belonging to the signal peptide. The C-score, or cleavage site, is the 
5 probability of the position being the first in the mature protein. The Y-score is the geometric average 
of the C-score and a smoothed derivative of the S-score. A conclusion of either a Yes or No is given 
next to each score. If all four conclusions are Yes and the C-terminal amino acid is either a 
phenylalanine (F) or a tyrosine (Y), the ORF was labelled outer membrane (Struyve, M, Moons, M., 
and J. Tommassen. 1991, Carboxy-terminal Phenylalanine is Essential for the Correct Assembly of a 

10 Bacterial Outer Membrane Protein. J. Mol. Biol. 218:141-148.) 

The program called Psort was used to determine the localization of a protein based on 
its signal sequence, recognition of transmembrane segments, and analysis of its amino acid 
composition (Nakai, K., and M. Kanehisa. 1991. Expert system for predicting protein localization 
sites in gram-negative bacteria. Proteins 1 1 :95-l 10.) An ORF is considered to be an outer membrane 

15 protein if the output data predicts the ORF encoded protein as outer membrane with a certainty value 
of 0.5 or better and whose value is at least twice as large as the next predicted localized certainty 
value. 

Finally, ORFs that were not predicted to be outer membrane or surface exposed, 
based on the above criteria, were further analyzed. The Blastp output data for these ORFs were 
20 searched using various general and specific keywords, suggestive of known cell surface exposed 
proteins. An ORF was labeled surface exposed if the keywords matched had a Blastp hit with a P 
score less than e ,0 , and there was no better data indicating otherwise. The following is a list of the 
searched keywords: 



25 


Adhesion 


Adhesin 


Invasin 




Invasion 


Extension 


Omp 




Outer Surface 


Porin 


Outer Membrane 




Cell Surface 


Cell Wall 


Pilin 




Flagellar sheath 


Cir 


ChuA 


30 


CopB 


ExeD 


FadL 




FecA 


FepA 


FhuA 




FmdC 


FomA 


FrpB 




GspD 


HemR 


HgbA 




Hgp 


HmbR 


HmuR 


35 


HMW 


HrcC 


Hrp 
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UlVKJ 


LamB 


LbpA 


LcrQ 


LmpJ 


MxiD 


MUMr 


PilE 


HpaA 


NoIW 


NspA 


OpcP 


5 OpnP 


Opr 


OspA 


rnoti 


PldA 


Por 


rscC 


PulD 


PupA 


QuiX 


RafY 


ScrY 


SepC 


ShuA 


SomA 


10 SpiA 


Tbpl 


Yop 


YscC 


mip 


Tol 


Pilus 


BtuB 





Those ORFs that did not meet the minimum requirement for being an outer membrane protein based 
on the above search criteria but which were homologous to identified outer membrane ORFs in 

15 Chlamydia pneumoniae were included. The Chlamydia pneumoniae genome (French patent 
application No. 97-14673, filed 21 November 1997) was analyzed using the above search criteria and 
a number of outer membrane ORFs were identified. These Chlamydia pneumoniae ORFs were then 
tested against the Chlamydia trachomatis genome using Blastp. Any Chlamydia trachomatis ORF 
with a Blastp P value less than e"'° against a Chlamydia pneumoniae outer membrane was included in 

20 this section, if there was no better data indicating otherwise. A list of ORFs in the Chlamydia 
trachomatis genome encoding putative surface exposed proteins is set forth above in the specification. 



Identification of Putative Li poproteins in the Genome of Chlamydia trachomatis 
Lipoproteins are the most abundant post-translationally modified bacterial secretory 

25 proteins (Pugsley, A. P.. 1993. The complete general secretory pathway in Gram-negative bacteria. 
Microbiol. Rev. 57:50-108). The characteristic features of lipoproteins are a thiol-linked 
diacylglyceride and an amine-linked monoacyl group on the cysteine that becomes the amino-terminal 
residue after signal peptide cleavage by Signal Peptidase H. (Pugsley, A. P.. 1993. The complete 
general secretory pathway in Gram-negative bacteria. Microbiol. Rev. 57:50-108). The identification 

30 of putative lipoproteins from the genomic sequencing of Chlamydia trachomatis was done by 
examining the deduced amino acid sequence of identified ORFs for the presence of a signal peptide 
with a Signal Peptidase II cleavage site analogous to the consensus sequence for prolipoprotein 
modification and processing reactions (Hayashi, S., and H. C. Wu. 1992. Identification and 
characterization of lipid-modified proteins in bacteria, p. 261-285. In N. M. Hooper and A. J. Turner 

35 (ed.) Lipid modification of proteins: A practical approach. Oxford University Press, New York; 
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Sutcliffe, I. C. and R. R. B. Russell. 1995. Lipoproteins of Gram-positive bacteria. J. Bacterid. 
177:1123-1128). 

The deduced amino acid sequences of Chlamydia trachomatis ORFs were initially screened 
for the most basic of lipoprotein characteristics, a cysteine in the first 30 amino acids of the deduced 
5 protein. ORFs with a standard start codon (ATG, GTG, or TTG) and having one or more of the 
following characteristics were selected for direct analysis of their first 30 amino acids: 

(a) Significant Signal P value (at least two out-of-the four values are Yes) 

(b) PSORT value indicating membrane passage (IM-inner membrane, Peri-periplasm, or 
OM-outer membrane) 

10 (c) Identification of the word lipoprotein among the ORF Blastp data set. 

(d) A Blastp value of <e* 10 with a putative lipoprotein from Chlamydia pneumoniae 
(French application No. 97-14673 filed 21 November 1997). 

The first 30 amino acids encoded by each ORF in this set were analyzed for the 
characteristics commonly found in lipoprotein signal peptides (Pugsley, A. P.. 1993. The complete 

15 general secretory pathway in Gram-negative bacteria. Microbiol. Rev. 57:50-108; Hayashi, S., and H. 
C. Wu. 1992. Identification and characterization of lipid-modified proteins in bacteria, p. 261-285. In 
N. M. Hooper and A. J. Turner (ed.) Lipid modification of proteins: A practical approach. Oxford 
University Press, New York; Sutcliffe, I. C. and R. R. B. Russell. 1995. Lipoproteins of Gram- 
positive bacteria. J. Bacterid. 177:1123-1128.) Putative lipoprotein signal peptides were required to 

20 have a cysteine between amino acid 10 and 30 and reach a minimum score of three based on the 
following criteria for lipoprotein signal peptides: 

(a) Identification of specific amino acids in specific positions around the cysteine which are 
part of the consensus Signal Peptidase II cleavage site (Hayashi, S., and H. C. Wu. 1992. 
Identification and characterization of lipid-modified proteins in bacteria, p. 261-285. In 

25 N - M - Hooper and A. J. Turner (ed.) Lipid modification of proteins: A practical approach. 

Oxford University Press, New York); Sutcliffe, I. C. and R. R. B. Russell. 1995. 
Lipoproteins of Gram-positive bacteria. J. Bacteriol. 177:1123-1128). Since the 
identification of the cleavage site is the most important factor in identifying putative 
lipoproteins, each correctly positioned amino acid contributed toward reaching the 

30 minimum score of three. 

(b) A hydrophobic region rich in alanine and leucine prior to the cleavage site (Pugsley, A. 
P.. 1993. The complete general secretory pathway in Gram-negative bacteria. Microbiol. 
Rev. 57:50-108) contributed toward reaching the minimum score of three. 

(c) A short stretch of hydrophilic amino acids greater than or equal to 1, usually lysine or 
35 arginine, following the N-terminal methionine (Pugsley, A. P.. 1993. The complete general secretory 
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pathway in Gram-negative bacteria. Microbiol. Rev. 57:50-108) contributed toward reaching the 
minimum score of three. 

A list of ORFs in the Chlamydia trachomatis genome encoding putative lipoproteins is set forth above 
in the specification. 

5 

LPS-Related ORFs of Chlamydia trachomatis 

Lipopolysaccharide (LPS) is an important major surface antigen of Chlamydia cells. 
Monoclonal antibodies (Mab) directed against LPS of Chlamydia pneumoniae have been identified 
that can neutralize the infectivity of Chlamydia pneumoniae both in vitro and in vivo (Peterson et al. 

10 1988). Similar results are expected utilizing monoclonal antibodies against LPS of Chlamydia 
trachomatis. LPS is composed of lipid A and a core oligosaccharide portion and is phenotypically of 
the rough type (R-LPS) (Lukacova, M., Baumann, M M Brade, L., Mamat, U. t Brade, H. 1994. 
Lipopolysaccharide Smooth-Rough Phase Variation in Bacteria of the Genus Chlamydia. Infect. 
Immun. June 62(6):2270-2276.) The lipid A component is composed of fatty acids which serve to 

15 anchor LPS in the outer membrane. The core component contains sugars and sugar derivatives such 
as a trisaccharide of 3-deoxy-D-manno-octuIosonic acid (KDO) (Reeves, P.R., Hobbs, M., Valvano, 
M.A., Skurnik, M., Whitfield, C, Coplin, D., Kido, N., Klena, J., Maskell, D., Raetz, C.R.H., Rick, 
P.D. 1996. Bacterial Polysaccharide Synthesis and Gene Nomenclature pp. 10071-10078, Elsevier 
Science Ltd.). The KDO gene product is a multifunctional glycosyltransferase and represents a 

20 shared epitope among the Chlamydia. For a review of LPS biosynthesis see, e.g., Schnaitman, C.A., 
Klena, J.D. 1993. Genetics of Lipopolysaccharide Biosynthesis in Enteric Bacteria. Microbiol. Rev. 
57:655-682. 

A text search of the ORF Blastp results identified several genes that are involved in 
Chlamydial LPS production with a P score less than e* 10 . The following key-terms were used in the 
25 text search: KDO, CPS (Capsular Polysaccharide Biosynthesis), capsule, LPS, rfa f rfb, rfc, rfe, rha, 
rhl, core, epimerase, isomerase, transferase, pyrophosphorylase, phosphatase, aldolase, heptose, 
manno, glucose, lpxB, fibronectin, fibrinogen, fucosyltransferase , lie, Igt, pgm, tolC, rol, ChoP, 
phosphorylcholine, waaF, PGL-Tbl. A list of ORFs in the Chlamydia trachomatis genome encoding 
putative polypeptides involved in LPS biosynthesis is set forth above in the specification. 

30 

Type HI And Other Secreted Products 

Type in secretion enables gram-negative bacteria to secrete and inject pathogenicity 
proteins into the cytosol of eukaryotic host cells (Hueck, C. J., 1998. Type HI Protein Secretion 
Systems in Bacterial Pathogens of Animals and Plants. In Microbiology and Molecular Biology 
35 Reviews. 62:379-433.) These secreted factors often resemble eukaryotic signal transduction factors, 
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thus enabling the bacterium to redirect host cell functions (Lee, C.A., 1997. Type HI secretion 
systems: machines to deliver bacterial proteins into eukaryotic cells? Trends Microbiol. 5:148-156.) 
In an attempt to corrupt normal cellular functions, Chlamydial pathogenicity factors injected into the 
host cytosol will nonetheless, as cytoplasmic constituents be processed and presented in the context of 
5 the Major Histocompatibility Complex (MHC class I). As such, these pathogenicity proteins 
represent MHC class I antigens and will play an important role in cellular immunity. Also included in 
this set are secreted non-type III products that may play a role as vaccine components. 

A text search of the ORF Blastp results identified genes that are involved in 
Chlamydia trachomatis protein secretion with a P score less than e" 10 . The following key-terms were 

10 used in the text search in an effort to identify surface localized or secreted products: Yop, Lcr, Ypk, 
Exo, Per, Pop, Ipa, Vir, Ssp, Spt, Esp, Tir, Hrp, Mxi, hemolysin, toxin, IgA protease, cytolysin, tox, 
hap, secreted and Mip. 

Chlamydia trachomatis ORFs that did not meet the above keyword search criteria, 
but have homologs in Chlamydia pneumoniae that do meet the search criteria are included herein. 

15 The Chlamydia pneumoniae genome (French patent application No. 97-14673, filed 21 November 
1997) was analyzed using the above search criteria and a number of ORFs were identified. These 
Chlamydia pneumoniae ORFs were tested against the Chlamydia trachomatis genome using Blastp. 
Any Chlamydia trachomatis ORF with a Blastp P value < e 10 against a Chlamydia pneumoniae 
homolog, identified using the above search criteria, was included. A list of ORFs in the Chlamydia 

20 trachomatis genome encoding putative secreted proteins is set forth above in the specification. 

Chlamydia trachomatis RGD Recognition Sequence 

Proteins that contain Arg-GIy-Asp (RGD) attachment site, together with integrins that 
serve as their receptor constitute a major recognition system for cell adhesion. The RGD sequence is 
25 the cell attachment site of a large number of adhesive extracellular matrix, blood, and cell surface 
proteins and nearly half of the known integrins recognize this sequence in their adhesion protein 
ligands. There are many RGD containing microbial proteins such as the penton protein of adenovirus, 
the coxsackie virus, the foot and mouth virus and pertactin, a 69 kDa (kilodalton) surface protein of 
Bordetella pertussis, that serve as ligands through which these microbes bind to integrins on the cell 
30 surfaces and gain entry into the cell. The following provides evidence supporting the importance of 
RGD in microbial adhesion: 

a) The adenovirus penton base protein has a cell rounding activity and when penton base was 
. expressed in E. coli, it caused cell rounding and cells adhered to polystyrene wells coated with the 
protein. Mutant analysis showed that both these properties required an RGD sequence. Virus 
35 mutants with amino acid substitutions in the RGD sequence, showed much less adherence to HeLa S3 
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cells, and also were delayed in virus reproduction (Bai, M. f Harfe, B., and Freimuth, P. 1993. 
Mutations That Alter an RGD Sequence in the Adenovirus Type 2 Penton Base Protein Abolish Its 
Cell-Rounding Activity and Delay Virus Reproduction in Flat Cells. J. Virol. 67:5198-5205). 

5 b) It has been shown that attachment and entry of coxsackie virus A9 to GMK cells were 

dependent on an RGD motif in the capsid protein VP1. VP1 has also been shown to bind a v p3 
integrin, which is a vitronectin receptor (Roivainen, M, Piirainen, L., Hovi, T., Virtanen, L, Riikonen, 
T., Heino, J., and Hyypia, T. 1994. Entry of Coxsackievirus A9 into Host Cells: Specific Interactions 
with a v p? Integrin, the Vitronectin Receptor Virology, 203:357-65). 

10 

c) During the course of whooping cough, Bordetella pertussis interacts with alveolar 
macrophages and other leukocytes on the respiratory epithelium. Whole bacteria adheres by means 
of two proteins, filamentous hemagglutinin (FHA) and pertussis toxin. FHA interacts with two 
classes of molecules on macrophages, galactose containing glycoconjugates and the integrin CR3. 

15 The interaction between CR3 and FHA involves recognition of RGD sequence at the positions 1097- 
1099 in FHA (Relman, D., Tuomanen, E., Falkow, S., Golenbock, D. T., Saukkonen, K., and Wright, 
S. D. u Recognitition of a Bacterial Adhesin by an Integrin: Macrophage CR3 Binds Filamentous 
Hemagglutinin of Bordetella Pertussis." Cell, 61:1375-1382 (1990)). 

d) Pertactin, a 69 kDa outer membrane protein of Bordetella pertussis, has been shown to 
promote attachment of Chinese hamster ovary cells (CHO). This attachment is mediated by 
recognition of RGD sequence in pertactin by integrins on CHO cells and can be inhibited by synthetic 
RGD containing peptide homologous to the one present in pertactin (Leininger, E., Roberts, M„ 
Kenimer, J. G., Charles, I. G„ Fairweather, N., Novotny, P., and Brennan, M. J. 1991. Pertactin, an 
Arg-Gly-Asp containing Bordetella pertussis surface protein that promotes adherence of mammalian 
cells Proc. Natl. Acad. Sci. USA, 88:345-349). 

e) The RGD sequence is highly conserved in the VP1 protein of foot and mouth disease virus 
(FMDV). Attachment of FMDV to baby hamster kidney cells (BHK) has been shown to be mediated 
by VP1 protein via the RGD sequence. Antibodies against the RGD sequence of VP1 blocked 
attachment of virus to BHK cells (Fox, G., Parry, N. R., Baniett, P. V M McGinn, B., Rowland, D. J., 
and Brown, F. 1989. The Cell Attachment Site on Foot-and-Mouth Disease Virus Includes the Amino 
Acid Sequence RGD (Arginine-Glycine-Aspartic Acid) J. Gen. Virol., 70:625-637). 

It has been demonstrated that bacterial adherence can be based on interaction of a 
bacterial adhesin RGD sequence with an integrin and that bacterial adhesins can have multiple 
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binding site characteristic of eukaryotic extracellular matrix proteins. RGD recognition is one of the 
important mechanisms used by microbes to gain entry into eukaryotic cells. 

The complete deduced protein sequence of the Chlamydia trachomatis "genome was 
searched for the presence of RGD sequence. There were a total of 38 ORFs that had one or more 
5 RGD sequences. Not all RGD containing proteins mediate cell attachment. It has been shown that 
RGD containing peptides that have proline immediately following the RGD sequence are inactive in 
cell attachment assays (Pierschbacher & Ruoslahti. 1987. Influence of stereochemistry of the 
sequence Arg-Gly-Asp-Xaa on binding specificity in cell adhesion. J. Biol. Chem. 262:17294-98). 
ORFs that had RGD, with proline as the amino acid following the RGD sequence were excluded from 
10 the list. Also, RGD sequence may not be available at the surface of the protein or may be present in a 
context that is not compatible with integrin binding. Since not all RGD-containing proteins are 
involved in cell attachment, several other criteria were used to refine the list of RGD-containing 
proteins. A list of ORFs in the Chlamydia trachomatis genome encoding polypeptides with RGD 
recognition sequence(s) is set forth above in the specification. 

15 

Non-Chlamvdia pneumoniae ORFs 

Chlamydia trachomatis ORFs were compared to the ORFs in the Chlamydia 
pneumoniae genome (French patent application No. 97-14673, filed 21 November 1997) using Blastp. 
Any Chlamydia trachomatis ORF with a Blastp «?» value greater than e" 10 (i.e. >e" 10 ) against 
20 Chlamydia pneumoniae ORFs are included in this section. A list of ORFs in the Chlamydia 
trachomatis genome which are not found in Chlamydia pneumoniae is set forth above in the 
specification. 

Cell Wall Anchor Surface ORFs 

25 Many surface proteins are anchored to the cell wall of Gram-positive bacteria via the 

conserved LPXTG motif (Schneewind, O., Fowler, A., and Faull, K.F. 1995. Structure of the Cell 
Wall Anchor of Surface Proteins in Staphylococcus aureus. Science 268:103-106). A search of the 
proteins encoded by the Chlamydia trachomatis ORFs was done using the motif LPXTG. A list of 
ORFs in the Chlamydia trachomatis genome encoding polypeptides anchored to the cell wall is set 

30 forth above in the specification. 

ECACC Deposits 

Samples of Chlamydia trachomatis were deposited with the European Collection of 
Cell Cultures (ECACC), Salisbury, Wiltshire SP4 OJG, UK on November 26, 1998 and assigned the 
35 provisional accession number 981 12618. Cells can be grown, harvested and purified, and DNA can 
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be prepared as discussed above. In order to enable recovery of specific fragments of the chromosome, 
one can run targeted PCR reactions, whose amplification products can then be sequenced and/or 
cloned into any suitable vector, according to standard procedures known to those skilled itf the art. 

In addition, a pool of clones covering the Chlamydia trachomatis genome was 
5 deposited with the ECACC on November 26, 1998 and assigned provisional accession number 
98112617. The pool of clones contains a series of clones, which when taken together, cover the 
whole chromosome, with a redundancy of slightly more than ten. The total number of clones in the 
sample is 13,572. 

Table 4 lists groups of oligonucleotides to be used to amplify each of ORFs 2-1197 

10 according to standard procedures known to those skilled in the art. Such oligonucleotides are listed as 
SEQ ID Nos. 1 198 to 5981. For each ORF, the following is listed: one forward primer positioned 
2,000 bp upstream of the beginning of the ORF; one forward primer positioned 200 bp upstream of 
the beginning of the ORF; one reverse primer positioned 2,000 bp downstream at the end of ORF, 
which is 2,000 bp upstream of the end site of the ORF on the complementary strand; and one reverse 

15 primer 200 bp downstream at the end of ORF, which is 200 bp upstream of the end site of the ORF on 
the complementary strand. The corresponding SEQ ID Nos. for the primers are listed in Table 4, 
where Fp is the proximal forward primer; Fd is the distal forward primer; Bp is the proximal reverse 
primer; and Bd is the distal reverse primer. The positions of the 5' ends of each of these primers on 
the nucleotide sequence of SEQ ID No. 1 are shown in Table 5. 

20 The present invention is not to be limited in scope by the specific embodiments 

described herein, which are intended as single illustrations of individual aspects of the invention, and 
functionally equivalent methods and components are within the scope of the invention. Indeed, 
various modifications of the invention, in addition to those shown and described herein will become 
apparent to those skilled in the art from the foregoing description and accompanying drawings. Such 

25 modifications are intended to fall within the scope of the appended claims. 

All publications and patent applications mentioned in this specification are herein 
incorporated by reference to the same extent as if each individual publication or patent application 
was specifically and individually indicated to be incorporated by reference. 
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683164 


AQA 


Oo4 / /4 


AO A A 1 0 

684418 


684639 


OI» 1 


/CO/1 QOQ 

0o4ojy 


/CO/COA7. 

686203 


684839 


oyz 


oooiy / 


687204 


686203 


U7J 


Do / j41 


/C O O O ZT A 

688360 


687341 


AQA 

oy4 


056432 


688193 


688426 


oyj 


ooVolo 


688432 


689601 




oeyyou 


689631 


689939 


£Q7 

oy i 


oyi)4o / 


689846 


690445 


AO© 


/COAT 1 *7 

690717 


690463 


690717 


£QQ 

oyy 


/CO 1 071 

oy 1 o / I 


iCAAiCT* 

690672 


691856 


7 An 


ZCAIOT.^? 

693837 


692041 


693837 


7A1 


oy4y34 


693837 


694934 


7A7 


oy /zoj 


/CO /I O/f 1 

694942 


697230 


7A^ 


oyoU54 


/TO*71 *7A 

69/170 


697958 


ICiA 


oyojyz 


iCA*7A*70 

6y /9 /y 


698380 




oyo/yz 


*7AA1 1 *7 

7001 17 


698792 


7A£ 
/UO 


7AAO/CO 

/00269 


700895 


700269 


7A7 


700912 


702165 


700990 


7AC 
/Uo 


/02183 


7034 12 


702183 


7AQ 

/uy 


/U3522 


705000 


703531 


710 


7050 I l 




7A^AA7 


711 


706159 


705704 


706093 


712 


706521 


706138 


706488 


713 


708103 


706496 


707932 
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714 


708398 


708078 


708392 


715 


708610 


708248 


708610 


716 


710278 


708872 


710203 


717 


711164 


710262 


711164 


718 


711432 


712763 


711432 


719 


712767 


713438 


712773 


720 


714232 


713651 


714217 


721 


714632 


714120 


714617 


722 


715592 


714834 


715739 


723 


715854 


715558 


715854 


724 


716937 


715921 


716886 


725 


718357 


717149 


718357 


726 


718500 


718862 


718590 


727 


719797 


718499 


719776 


728 


720273 


719782 


720147 


729 


720452 


720144 


720452 


730 


720613 


721575 


720613 


731 


721559 


722356 


721571 


732 


723248 


722397 


723239 


733 


724598 


723378 


724580 


734 


725763 


724576 


725760 


735 


726519 


725767 


726519 


736 


726819 


726538 


726801 


737 


727493 


726753 


727466 


738 


727984 


727469 


727984 


739 


728778 


728329 


728718 


740 


729346 


728759 


729334 


741 


732639 


729442 


732639 


742 


733246 


734427 


733246 


743 


734814 


735659 


734814 


*7AA 
f*t*f 


/J5644 


736504 


735644 


745 


736520 


737254 


736520 


746 


737254 


737787 


737254 


747 


737942 


738679 


738122 
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■ :.prefen^ start 


748 


738838 


739740 


738862 


749 


742057 


740060 


741982 


750 


742869 


742045 


742824 


751 


743378 


742824 


743348 


752 


744298 


743306 


744292 


753 


744714 


744430 


744660 


754 


744985 


744611 


744931 


755 


745557 


744958 


745548 


756 


746412 


745561 


746409 


757 


746772 


746416 


746697 


758 


748269 


746944 


748269 


759 


748966 


748274 


748954 


760 


749426 


748965 


749411 


761 


749702 


749433 


749681 


762 


750029 


749721 


750020 


763 


752307 


750007 


752307 


764 


752913 


752503 


752901 


765 


754659 


753616 


754659 


766 


755000 


756814 


755000 


767 


756796 


758301 


756832 


768 


758691 


758446 


758688 


769 


759787 


759338 


759787 


770 


760242 


759871 


760188 


771 


760538 


760188 


760529 


772 


760966 


761772 


760966 


773 


761759 


762142 


761759 


774 


762267 


762983 


762267 


775 


764465 


763335 


764312 


776 


764857 


764438 


764821 


777 


766068 


764821 


765972 


778 


766643 


766065 


766643 


779 


768091 


766934 


768091 


780 


768785 


768252 


768785 


781 


770092 


768791 


770062 
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7R? 


7701 1R 

/ /VI JO 


/ /U4/U 


77A ten 


7R1 


77nfifi 1 


77A1 CC. 


7*7A/C*J 1 

77063 1 


7R4 


77nQ74 


770A1/1 


"7"7AOA >* 

770894 


7RS 


7770 in 


771 710 
/ / 1 jjU 


777A1 A 


7Rfi 


/ /Zjyu 


771101 
/ / Jjy 1 


"7*711 OA 

772390 


7R7 

tot 


774771 


771477 
/ / J4Z / 


774215 


7RR 


77^n^S 
/ /OUj J 


77/1 1Q] 

/ /4ly 1 


*7*7iTAO C 

776035 


7SQ 


/ /000J 


7777A/T 

777700 


776894 


7 on 
/ yu 


7771 OS 

/ / / iyj 


77/CAC1 

/ /oyj j 


777177 


701 


77Q797 
/ /yZZZ 


777717 
/ / / / jZ 


*7*7 A 1 OA 

779180 


709 


77Q171 
/ fyjz. 1 


/oi jjZ 


779360 


703 

/ 2* j 


7R1 7Q7 


7ff 7/M7 

/oZ44Z 


781351 


704 


7R7A47 


/ojjZ4 


*701>1 AH 

7o2A47 


70S 


/ojj jZ 


7C/CAA1 

/oOUUz 


785697 


7QA 
/yo 


/oOjoU 


70C</1^ 

7o5546 


to f c r>s\ 

786580 


707 
/y / 


7077 71 1 

/O / /41 


/066i 1 


787729 


70S 
/y o 


707/COA 
/5/OZU 


/ooUZl 


TOTTOT 

787782 


700 
/ yy 




7C7Q7A 

/o/yZU 


790064 


ROD 

0\J\J 


7on 1 ff\ 


7QA/TAQ 

/yuouy 


taai to 

790178 


R01 


7onAi4 


/yZU 10 


790634 


R07 


tyj\JOH 


/9Z059 


no vt 

793084 




/yJ34j 


794056 


793370 




7Qyin J 4 A 

/ y*fU4o 


794957 


T A A ATA 

794079 


ROS 




/9j144 


795395 




f 7J J / J 


/yozjj 


795575 


R07 


7Q677R 
/yOZ /o 


/y /ui j 


T A/TO 1 1 

7963 1 1 


ROR 


7Q£Q7Q 

/yoy /y 


797365 


796979 




7077/CA 

/y /zou 


797856 


TAT^rtff 

797395 




/9777Z 


798086 


797805 


Rl 1 
oil 


/yo4Zo 


797935 


T AO 1 A*S 

798393 


812 


798925 


70R41n" 
/ ^ o*t 1 u 


/yoy 10 


813 


799301 


799927 


799301 


814 


800892 


800029 


800892 


815 


801062 


802129 


801062 
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: prefencd start 


816 


802023 


802673 


802041 


817 


802851 


803246 


802920 


818 


803105 


804220 


803111 


819 


804307 


805356 


804331 


820 


805290 


806282 


805356 


821 


806453 


808081 


806498 


822 


808026 


809009 


808098 


823 


810461 


809079 


810437 


824 


811605 


810328 


811590 


825 


811725 


812342 


811824 


826 


812329 


813522 


812398 


827 


813455 


813772 


813455 


828 


813732 


814334 


813780 


829 


815213 


814314 


815207 


830 


814878 


814396 


814975 


831 


815733 


815428 


815733 


832 


816116 


817456 


816170 


833 


817608 


819320 


817608 


834 


819324 


819713 


819342 


835 


819704 


820402 


819713 


836 


820375 


821061 


820453 


837 


821043 


821537 


821043 


838 


821646 


822239 


821667 


839 


822182 


822931 


822221 


840 


824355 


823045 


824352 


841 


825894 


824359 


825891 


842 


826322 


825879 


826322 


843 


826340 


827026 


826340 


844 


827014 


827250 


827014 


845 


827856 


827230 


827856 


846 


828007 


829275 


828025 


847 


829355 


830953 


829358 


848 


831119 


831748 


831140 


849 


8321S2 


831751 


832140 
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pr<rfeire<i : ^art 


850 


832744 


832214 


832666 


851 


833446 


832805 


833446 


852 


833802 


833368 


833742 


853 


834679 


833879 


834661 


854 


835452 


834661 


835365 


855 


835778 


835371 


835775 


856 


836482 


835775 


836470 


857 


836602 


837264 


836617 


858 


837209 


838699 


837209 


859 


838760 


839575 


838760 


860 


839942 


840583 


839951 


861 


840445 


841713 


840451 


862 


841659 


842459 


841686 


863 


842523 


843068 


842541 


864 


843495 


843031 


843447 


865 


843239 


846196 


843335 


866 


844137 


843802 


844077 


867 


848043 


846217 


848022 


868 


850123 


848150 


850099 


869 


851645 


850230 


851504 


870 


853696 


851669 


853672 


871 


854836 


853700 


854809 


872 


855525 


854920 


855468 


873 


856240 


855437 


856240 


874 


857183 


856233 


857006 


875 


859439 


857451 


859430 


876 


859946 


859587 


859916 


877 


859642 


860640 


859660 


878 


861599 


860724 


861599 


879 


862053 


861580 


862038 


880 


863540 


862098 


863531 


881 


863930 


863571 


863927 


882 


864697 


863996 


864688 


883 


864923 


866248 


864923 
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8S4 


866303 


866605 


866336 


885 


866665 


867732 


866665 


886 


867810 


869090 


867864 


887 


869094 


869357 


869094 


888 


869270 


871372 


869336 


889 


871299 


872582 


871359 


890 


872429 


872860 


872555 


891 


872773 


873915 


872773 


892 


873812 


873360 


873668 


893 


874028 


874438 


874067 


894 


874778 


875386 


874796 


895 


875774 


876382 


875843 


896 


877872 


877000 


877866 


897 


878172 


877876 


878157 


898 


879098 


878172 


879098 


899 


878883 


879161 


878886 


900 


879842 


879105 


879809 


901 


880885 


880052 


880885 


902 


881863 


880889 


881863 


903 


882904 


881948 


882901 


904 


883794 


882901 


883761 


905 


884296 


883661 


884296 


906 


884996 


884508 


884984 


907 


888777 


885166 


888771 


908 


890172 


888940 


890172 


909 


891164 


890325 


891146 


910 


891463 


891116 


891427 


911 


893278 


891968 


893278 


912 


893356 


893808 


893386 


913 


893909 


893643 


893894 


914 


894276 


893821 


894276 


915 


894778 


894248 


894760 


916 


895892 


895050 


895874 


917 


895951 


896829 


895963 
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918 


900783 


897064 


900774 


919 


902032 


900791 


902158 


920 


902659 


903876 


902659 


921 


903731 


903471 


903731 


922 


903860 


905605 


903860 


923 


905725 


906474 


905725 


924 


906493 


906945 


906493 


925 


907306 


907001 


907306 


926 


908101 


908742 


908131 


927 


908721 


909194 


908724 


928 


909198 


909584 


909201 


929 


909583 


909951 


909670 


930 


910081 


910569 


910090 


931 


910615 


910944 


910636 


932 


910948 


912261 


910951 


933 


912399 


912629 


912399 


934 


912595 


913218 


912595 


935 


913203 


913676 


913218 


936 


913691 


914485 


913691 


937 


914516 


915136 


914522 


938 


915144 


915467 


915162 


939 


915629 


916633 


915629 


940 


916051 


916539 


916159 


941 


916965 


917627 


916965 


942 


917612 


918304 


917612 


943 


918323 


918655 


918323 


944 


918682 


919533 


918682 


945 


919542 


919829 


919542 


946 


919723 


920157 


919723 


947 


920184 


920840 


920184 


948 


920866 


921294 


920866 


949 


921272 


921514 


921272 


950 


921510 


921758 


921510 


951 


921778 


922143 


921778 



WO 99/28475 



PCT/IB98/01939 



SEQ.IDN© 


; =^0k^^ 




prefetre&start 


952 


922159 


922491 


922159 


953 


922496 


923035 


922496 


954 


923160 


923453 


923160 


955 


923484 


924032 


923484 


956 


924048 


924425 


924057 


957 


924443 


924937 


924443 


958 


924933 


925364 


924933 


959 


925390 


926760 


925390 


960 


926819 


927184 


926819 


961 


927209 


927604 


927209 


962 


927577 


928155 


927577 


963 


928100 


928759 


928127 


964 


929222 


930244 


929243 


965 


930222 


930656 


930258 


966 


930608 


931078 


930665 


967 


931367 


931666 


931406 


968 


931549 


931959 


931558 


969 


932070 


932579 


932070 


970 


932602 


933201 


932602 


971 


933319 


933621 


933319 


972 


933522 


933785 


933522 


973 


934546 


933848 


934546 


974 


936377 


934539 


936377 


975 


938081 


936666 


938081 


976 


938538 


939098 


938595 


977 


939329 


940933 


939506 


978 


941031 


942068 


941076 


979 


942082 


944685 


942082 


980 


944634 


945287 


944673 


981 


945287 


946294 


945287 


982 


946293 


946676 


946368 


983 


947105 


948454 


947132 


984 


948522 


949277 


948546 


985 


949277 


949594 


949277 
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986 


949849 


950676 


949888 


987 


950680 


951330 


950701 


988 


951281 


951643 


951290 


989 


951788 


952798 


951803 


990 


953581 


954264 


953602 


991 


954426 


955157 


954429 


992 


955754 


957940 


955766 


993 


957837 


959312 


957867 


994 


959299 


961050 


959317 


995 


961562 


961053 


961562 


996 


962575 


961487 


962545 


997 


961979 


961584 


961979 


998 


964914 


962545 


964914 


999 


964941 


965708 


964956 


1000 


967023 


966193 


966984 


1001 


967444 


968061 


967459 


1002 


968903 


968064 


968792 


1003 


970685 


969528 


970685 


1004 


971806 


971024 


971785 


1005 


973053 


972388 


973026 


1006 


974546 


973746 


974546 


1007 


975223 


974558 


975214 


1008 


976193 


975207 


976193 


1009 


976520 


976254 


976511 


1010 


976588 


976899 


976588 


1011 


976886 


977635 


976934 


1012 


977661 


977933 


977682 


1013 


977918 


978433 


977933 


1014 


978619 


978984 


978619 


1015 


978933 


979331 


978987 


1016 


981197 


979389 


981197 


1017 


979711 


980112 


979753 


1018 


982116 


981148 


982107 


1019 


982321 


983598 


982321 
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1020 


984488 


983862 


984296 


1021 


985381 


984371 


985381 


1022 


986103 


985399 


986046 


1023 


986693 


986046 


986693 


1024 


987607 


986693 


987607 


1025 


988119 


987616 


987942 


1026 


988253 


987936 


988247 


1027 


988831 


989163 


988834 


1028 


989693 


993442 


989693 


1029 


993408 


993785 


993408 


1030 


993835 


993416 


993754 


1031 


993882 


994262 


993906 


1032 


994226 


995656 


994259 


1033 


996036 


996611 


996036 


1034 


996885 


998267 


996885 


1035 


998962 


999225 


998962 


1036 


999375 


1001033 


999393 


1037 


1001211 


1001516 


1001214 


1038 


1001392 


1001664 


1001443 


1039 


1003721 


1001823 


1003721 


1040 


1004459 


1004845 


1004459 


1041 


1004990 


1005382 


1004990 


1042 


1005391 


1007496 


1005391 


1043 


1007486 


1007821 


1007453 


1044 


1007802 


1008698 


1007841 


1045 


1009426 


1009121 


1009426 


1046 


1010534 


1012054 


1010534 


1047 


1012397 


1011942 


1012241 


1048 


1012042 


1012635 


1012057 


1049 


1012593 


1012862 


1012593 
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1050 


1012811 


1013440 


1012829 


1051 


1013456 


1014055 


1013468 


1052 


1013977 


1014489 


1013977 


1053 


1015224 


1014529 


1015206 
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1054 


1016002 


1015145 


1015963 


1055 


1017120 


1015939 


1017120 


1056 


1017766 


1017245 


1017658 


1057 


1018911 


1017916 


1018893 


1058 


1019191 


1018580 


1019110 


1059 


1 AAA 1 A A 

1020199 


1019831 


1020196 


1060 


1021007 


1020114 


1020992 


1061 


1 A A 1 ^ ✓"A 

1021569 


1021075 


1021557 


1062 


« AAA J 1 4 

1022411 


1022097 


1022402 


1063 


1023344 


1023667 


1023344 


1064 


1023701 


1023949 


1023701 


1065 


1 AA O r\T 

1023976 


1024776 


1023976 


1066 


1 A^ Jin 4 

1024704 


1025045 


1024704 


1067 


1 AA f A A i 

1025881 


1024967 


1025845 


1 A /T> 

1068 


1026546 


1025839 


1026546 


1069 


1027379 


1026546 


1027373 


1070 


1030604 


1027929 


1030328 


1071 


1033252 


1030508 


1033249 


1072 


1031733 


1032086 


1031823 


1073 


1037037 


1033456 


1037016 


1074 


1035674 


1035910 
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AQA 
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992 


3179 


3178 


5571 


5570 


993 


3181 


3180 


5573 


5572 


994 


3183 


3182 


5575 
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998 


3191 


3190 
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3197 


3196 
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3198 
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3200 
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5596 


1006 


3207 


3206 
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1000 


3211 
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5609 


5608 
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3341 


3340 
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3356 


5749 


5748 


1 OR7 


11CQ 
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3362 


j / j j 
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3366 
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3368 
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5760 



WO 99/28475 



224 



PCT/IB98/01939 













1 ACQ 


on 1 

3371 


3370 


5763 
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3373 
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5765 
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ioyz 


3379 
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3410 
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5805 
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1111 
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5809 


5808 
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3447 
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5845 
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3454 
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3457 


3456 


5849 
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1132 


3459 
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5850 
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1134 


3463 
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5857 
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B 
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r 
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F 
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B 
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F 
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B 
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F 
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B 
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B 
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F 
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F 
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F 
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position 
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position 
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1230 


F 


15903 




2825 


F 


798866 




4420 


B 


411645 
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r 
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2826 


F 
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B 
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t 
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F 
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B 
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r 
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F 
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B 
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B 
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r 
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B 
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F 
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F 
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F 
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B 
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r 
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F 
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B 
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2835 


F 
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B 
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F 
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F 
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4431 


B 


416499 
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F 
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F 
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B 
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F 
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F 
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B 
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1244 


F 


11 OAA 
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F 
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B 


417700 
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F 
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F 
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B 
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r 
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2841 


F 
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B 
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r 
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F 
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B 
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T? 

F 
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2843 


F 
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B 


418823 
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F 
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F 
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4439 


B 


420732 
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1250 


F 


26379 


2845 


F 


809555 


4440 


B 
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1 1 C 1 
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F 


24476 


2846 


F 


812088 


4441 


B 
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i o<i 


T? 

r 


1 1 

27498 


2847 


F 
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4442 


B 


420461 
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T? 

r 
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F 


813225 
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B 
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r 
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F 
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B 
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T? 

r 
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F 
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B 
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F 
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F 
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4446 


B 


422265 


1 Kl 
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F 
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F 
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B 
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1 ICO 

1258 


F 
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F 
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B 
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F 
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F 
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B 
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r 
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B 
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B 


509202 


1477 


p 
r 


i i icon 
1 IZjoU 




i ft *> 
3022 


F 


893591 




4617 


B 


511062 


1478 


p 
r 


1 1 5 1 10 

1 Ol 3o 




1 ft *5 O 

3023 


F 


891684 




4618 


B 


510940 


147Q 


p 
r 


1 1 3lyo 




3024 


F 


894005 




4619 


B 


512832 


1430 


F 


1 1 5765 




1 07 c: 


r 


□ y z x z / 




4620 


B 


511747 


1431 


F 


L13891 




3026 


F 


B94827 




4621 


B . 


513649 


1432 


F 


119580 




3027 


F 


B92900 




4622 


B . 


512446 


1433 


F 


117660 




3028 


F 


B95732 




4623 


B ! 


514305 



WO 99/28475 
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ur. 


position 




ony XD 


Or . 


position 


SEQ ID 


Or. 


position 


1414 


F 








r? 
r 


893 790 


4624 


B 


513313 


1415 


F 


171 01 A 






r 


O Q ^ O O O 

896823 


4625 


B 


515212 


1416 


F 


174640 




0 a 0 n 
jUj! 


r 


894 y07 


4626 


B 


514529 


1417 


F 


1 777^1 






tr 
r 


y00571 


4627 


B 


516410 


1418 


F 






*3 n 1 1 


r 


898639 


4628 


B 


515466 


1419 


F 


171416 




1 n "3 /i 


r 


A A O VI A *7 

902407 


4629 


B 


517364 


1440 


F 


176101 






rr* 
C 


O A A C A O 

900507 


4630 


B 


515496 


1441 


F 


1 74708 




JUJb 


T71 

r 


A A *5 O >1 *3 

903243 


4631 


B 


517389 


1442 

1 Til 


F 


1 76871 




OUJ / 


r 


901346 


4632 


B 


516069 


1443 


F 


17S01 1 




0 uo 0 


r 


903616 


4633 


B 


517978 


1444 


F 


12760R 






r 


O A 1 *7 A £ 

901726 


4634 


B 


516642 


1445 


F 


1 75787 




7 n a n 

J 040 


r 


905486 


4635 


B 


518551 


1446 


F 


1 7046<; 




r 


a a "5 ron 

903589 


4636 


B 


517420 


1447 


r 




,5 042 


F 


906234 


4637 


B 


519349 


1448 


F 


i 3U /yy 




F 


904350 


4638 


B 


518187 


144Q 


F 




3 044 


r 


906774 


4639 


B 


520053 


1450 


F 


111615 


OA/ r 


r 


A A yl O A ^* 

904 846 


4640 


B 


518617 


1451 


F 


17071 1 
1 til 




r 


907868 


4641 


B 


520500 


1452 


F 


1 1785fi 


1 n 4 1 


T? 

r 


A A C A A it 

905924 


4642 


B 


519078 


1451 


F 


1 1001 A 


Irt/ Q 

JU4o 


r 


908501 


4643 


B 


520963 


1454 


F 


1 llAfii 


j 04 y 


T? 

r 


906583 


4644 


B 


519736 


1455 


F 
r 


1 1 1 A7A 






r 


908975 


4645 


B 


521636 


1456 


F 


111674 






tr 
r 


O A *7 A *7 A 

907079 


4646 


B 


520719 


1457 


F 


111706 




1 n 7 


IT 

r 


909351 


4647 


B 


522655 


1458 


F 


1 14185 




OUOJ 


T? 

r 


q A *7 >i rr 

907456 


4648 


B 


522221 


1459 


F 


1 17500 






c 
r 


A A A A *i r~ 

909835 


4649 


B 


524115 


1460 


F 


1 171 fil 






F 


907957 


4650 


B 


522354 


1461 


F 






JObo 


F 


910382 


4651 


B 


524287 


1462 


F 
r 


1 40 1 HA 




T A C *7 

3057 


F 


908496 


4652 


B 


523763 


1461 


F 






"5 A C O 

3058 


F 


910693 


4653 


B 


525689 


1464 


F 


140810 

I TVOJ7 




10,^0 


r 




4654 


B 


524854 


1465 


F 


138927 




3060 


F 


912169 


4655 


B « 


526756 


1466 


F 


141535 




3061 


F 


310248 


4656 


B \ 


525970 


1467 


F 


139614 




3062 


F . 


912376 


4657 


B ! 


527866 
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seo m 


Or 


position 


1468 


p 


147Q0Q 


l*TU7 


p 


140057 


1470 


P 

r 


14KQ4 
IHJOOH 


1471 


T? 

r 


1 A 1 TOO 


147? 


p 




147^ 


p 
r 


147175 


i*t / *T 


P 


14A1 7fi 


147S 


P 
r 




1476 


p 


14£fi04 


1477 


r 




147£ 


r 


14/ojo 


147Q 


p 
r 


1 450/^0 

i^oyou 


1 *tou 


p 
r 


1 48977 


1481 


p 
r 


1 4^147 


1482 


p 


14R7R1 


1483 


p 


1 yl^O/lz: 

l*K)o40 


1484 


p 
r 


1 4RG47 
140V4/ 


14R^ 


p 
r 


1 vt 7 AO 1 
14/UZl 


1486 


p 
r 


AQA1A 


1487 


p 
r 




1488 


p 


1 507AQ 

i ju /oy 


1489 


p 




14Q0 


p 
r 


c 1 741 


14Q1 
iHy l 


r 


1 /tQO OA 


14Q7 


p 


i jzo^y 


1493 
iiy J 


p 
r 


i ju /oy 


1494 


p 


1 511 Ol 


1495 


p 


1 c 1 77n 
I J 1Z /u 


1496 


p 


1 5171 0 

i jj / iy 


1407 


p 

r 


1 5 1 QCA 


1498 


P 
i 


1 SS007 


1499 


F 


153096 


1500 


F 


156550 


1501 


F ] 


[54687 



SEQ ID 


Or, 


position 




SEQ ID 


Or, 


p siti n 


*J A £ O 

3063 


F 


910476 




4658 


B 


526312 


3064 


F 


912984 




4659 


B 


528202 


3065 


F 


911084 




4660 


B 


526640 


3066 


F 


913437 




4661 


B 


528553 


3067 


F 


911545 




4662 


B 


526991 


3068 


F 


914282 




4663 


B 


528855 


3069 


F 


912376 




4664 


B 


528553 


3070 


F 


914925 




4665 


B 


530443 


3071 


F 


913023 




4666 


B 


529081 


3072 


F 


915394 




4667 


B 


530988 


3073 


F 


913510 




4668 


B 


529943 


3074 


F 


915827 




4669 


B 


531844 


3 075 


F 


913912 




4670 


B 


530424 


3076 


F 


916683 




4671 


B 


532301 


3077 


F 


914788 




4672 


B 


530799 


3078 


F 


917347 




4673 


B 


532675 


3079 


F 


915438 




4674 


B 


531670 


3080 


F 


918089 




4675 


B 


533594 


3081 


F 


916189 




4676 


B 


533498 


"5 a r> o 

3082 


F 


918399 




4677 


B 


535393 


3083 


F 


916506 




4678 


B 


534147 


3084 


F 


919296 




4679 


B 


535997 


3085 


F 


917406 




4680 


B 


534892 


3086 


F 


919457 




4681 


B 


536813 


3087 


F 


917598 




4682 


B 


536191 


3088 


F 


919864 




4683 


B 


538068 


3089 


F 


917963 




4684 


B 


539438 


T A ft A 

3090 


F 


920641 




4685 


B 


541306 


3091 


F 


918711 




4686 


B 


540771 


3092 


F 


921029 




4687 


B . 


542639 


1 0 01 

o u y j 


r 


?19138 




4688 


B ! 


541223 


3094 


F . 


921239 




4689 


B ! 


543141 


3095 


F ! 


319366 




4690 


B ! 


542025 


3096 


F < 


521526 




4691 


B ; 


543927 
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biLl<> 111 


dr. 


position 




SEQ ID 


Or. 


position 


SEQ ID 


Or. 


position 


1302 


P 

r 


157206 




3097 


F 


919638 


4692 


B 


543495 


1 ^ni 


c 
r 






3098 


F 


921930 


4693 


B 


545375 


1 S(\A 


p 
r 






O ft A A 

3 099 


F 


919979 


4694 


B 


544367 


1 

1 JuJ 


p 
r 


1 jOy&H 






F 


922212 


4695 


B 


546253 


I Jvu 


p 
r 


X jVO/O 




J 101 


F 


920325 


4696 


B 


544790 


1 507 


p 


1 ^770< 

i j / /io 




3102 


F 


922925 


4697 


B 


546697 


i 5nc 


P 


iouy>/ 




3103 


F 


921029 


4698 


B 


544982 


1 5no 


p 
r 


1 oyuoj 




3104 


F 


923258 


4699 


B 


546890 




p 
r 


1013 IV 




3105 


F 


921324 


4700 


B 


546655 


1 51 1 


p 
r 






3106 


F 


923808 


4701 


B 


548555 


1 51 ? 


p 
r 


L OZ 1 3 1 




3107 


F 


921929 


4702 


B 


547701 


10 ID 


p 
r 


1 <AO /I A 

1 0UZ4U 




3108 


F 


924165 


4703 


B 


549667 


1 51 A 


p 
r 


1 cine 

1 02775 




3109 


F 


922311 


4704 


B 


547609 


1 51 5 

1 J ID 


p 
r 


i £(\o/:c 
loOouj 




3110 


F 


924680 


4705 


B 


549533 


1 51£ 
1 J ID 


p 
r 


16423 0 




3111 


F 


922764 


4706 


B 


548121 


1517 


p 
r 


1 02345 




3112 


F 


925111 


4707 


B 


550040 


1 51 R 


p 
r 


1 /^on 
lOJOJ / 




3113 


F 


923258 


4708 


B 


548878 


1 51 Q 


p 
r 






3114 


F 


926538 


4709 


B 


550836 


1 570 


p 
r 


1003Uo 




3115 


F 


924638 


4710 


B 


549681 


1 591 


p 
r 


1 4ZA/Zf\C 

104oU,> 




3116 


F 


926972 


4711 


B 


551602 


1 57? 
1 «?x^ 


p 


lOOOlZ 




3117 


F 


925072 


4712 


B 


550605 


1 59^ 

l J 4. J 


p 
r 


lODOOJ 




3118 


F 


927351 


4713 


B 


552527 


1 594 


p 
r 


loioo/ 




3119 


F 


925415 


4714 


B 


551849 


1595 


p 
r 


10/430 




linn 

3120 


F 


927870 


4715 


B 


553750 


1 59£ 

1 J^O 


p 
r 


1 *7ACCiC 

1 /IDSo 




3121 


F 


925924 


4716 


B 


553261 


1 597 


p 
r 


looooi 




3122 


F 


928974 


4717 


B 


555165 




p 


1 /1U67 




3123 


F 


927031 


4718 


B . 


555176 


1 590 


p 
r 


1 ^G1 *71 

169173 




3124 


F 


930003 


4719 


B . 


557075 


1 53n 


r 


L 72090 




3125 


F . 


928103 


4720 


B « 


556590 


1 511 
1 J j 1 


r 


1 "7A 1*7 1 

L70171 




3126 


F . 


930383 


4721 


B i 


558489 


1532 


F 
i 


777Q7 








^28402 


4722 


B J 


557130 


1533 


F ] 


170904 




3128 


F < 


331084 


4723 


B 5 


559024 


1534 


F 


74176 




3129 


F < 


329222 


4724 


B 5 


558346 


1535 


F 


72236 




3130 


F < 


331307 


4725 


B I 


560246 
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DHi\£ UJ 




position 




5EQ XD 


Or . 


position 




SEQ ID 


Or. 


position 


IJjU 


p 

r 


I OU4o 






F 


929397 




4726 


B 


558455 


1 ^17 
1 JJ / 


r 


173138 




3132 


F 


931824 




4727 


B 


560376 


I J JO 


XT 

r 


175476 




3133 


F 


929927 




4728 


B 


559450 


1 ^10 


P 

r 


1 71 cnc 




3134 


F 


932352 




4729 


B 


561362 


1 JHu 


p 


1 771 CI 




J 1 JO 


r-i 

F 


930470 




4730 


B 


561581 


1 541 


p 


l 7C97C 




J lib 


r 


933044 




4731 


B 


563478 


1 547 

1 JHZ 


P 

r 


1 770CQ 




3 137 


F 


931084 




4732 


B 


563153 




r 


1 7<0/t0 

1 /jyHZ 




3138 


F 


933303 




4733 


B 


565073 


1 KAA 


p 
r 


1 7Q Q 1 O 




3139 


F 


931396 




4734 


B 


564319 


1 jHj 


p 


1 7*7001 

1 / 




O 1 A A 

3 140 


F 


933626 




4735 


B 


566220 


1 546 


p 
r 


1 CHjI 1 o 
1 5U4 1 z 




3 141 

■s 


F 


931686 


4736 


B 


564201 


1 547 


r 






J 14z 


F 


934320 


4737 


B 


566078 


1 54R 


p 


1011 £Q 

loi ioy 




F 


932412 


4738 


B 


567243 


1 540 


p 
r 


1 7077/C 

1 /yzzo 


1 1 /I /l 

3144 


F 


936427 


4739 


B 


569143 


i 


P 
r 


1 C7/C7Q 
loZOZO 


3145 


F 


934508 


4740 


B 


568192 


1 ^<;i 

Ljj 1 


r 


1 50694 


3146 


F 


938309 


4741 


B 


570133 


I J 3Z 


r 


i ci/ini 


3147 


F 


936402 


4742 


B 


570619 


1 D D J 


p 
r 


lol4y4 


3148 


F 


939110 


4743 


B 


572532 


1 554 


p 


1 C/l<77 

1 o4D / / 


3149 


F 


937204 


4744 


B 


572241 


1 555 


p 


lozozo 


*5 t C A 

3150 


F 


940791 


4745 


B 


574208 


1 556 


p 
r 


1 Q<7£1 
1 0 J / OO 


3151 


F 


938889 


4746 


B 


572994 


1 557 


p 
r 


1 QIC/I 0 


3 152 


F 


941806 


4747 


B 


574916 


1 558 


p 
r 


504^0 


3153 


F 


939906 


4748 


B 


573744 


1 55Q 


p 
r 


1 C/1A1 /I 
104014 




1 1 C A 

3 154 


F 


944314 




4749 


B 


575640 


1560 


p 


871 C7 
lo / 15 / 




3155 


F 


942412 




4750 


B 


573679 


1561 


p 


1 R^7A7 
1 oOZOZ 




J loo 


F 


944987 




4751 


B 


575571 




p 
r 


1 RS'wl'5 
155D4Z 




3157 


F 


943090 




4752 


B 


574398 


1 563 


p 


Q<£17 
1 OOOJZ 




3158 


F 


946072 




4753 


B 


576288 


1 SA4 


p 
r 


00/1 1 A 

1 oyn 1 U 




3159 


F 


944166 




4754 


B 


574677 


1 


*c 
r 


1 0*7 C 1 /I 

157514 




3160 


F 


946877 




4755 


B 


576633 


1566 


p 

i. 






J lo 1 


r 


944 986 




4756 


B 


575033 


1567 


F 


188083 




3162 


F 


948258 




4757 


B . 


576922 


1568 


F 


190545 




3163 . 


• F 


946403 




4758 


B ! 


575482 


1569 


F 1 


188666 




3164 


F 


949037 




4759 


B ! 


577363 
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ur, 


position 




id 


ur . 


position 




SEQ ID 


Or. 


position 


1 570 


F 
r 


101 S7fi 




7 i £c; 


IT 


Qyl 71 O -7 

y4 / 1 j / 




4760 


B 


575699 


1 571 

1*7/1 


F 
r 


1 OQCO^ 

loyDyO 




Jlob 


TT 

r 


Qyl OCD1 

y^yDoi 




4761 


B 


577606 


1 577 


F 


1 00 1 77 
1 yz 1 / J 




7 1 CH 


XT 


y4 /o/i 




4762 


B 


576078 


1 57** 
i j/j 


F 

r 


1 007/17 




J loo 


T? 

r 


o c n /i c c" 

yb0455 




4763 


B 


577993 


1 574 


F 


iyjui j 




j ioy 


rp 

F 


r\ a n r r n 

948558 




4764 


B 


578265 


1 575 


p 
r 


iy 1 I jj 




jl / u 


r 


351058 




4765 


B 


580143 


1 576 


F 
r 


i y44 / 1 




.31/1 


T? 
f 


949151 




4766 


B 


578948 


1 577 


F 
r 


1 G7<77 




linn 


F 


951569 




4767 


B 


580848 


1 578 


F 


1 Q^O/IA 




,51 / j 


TP 

r 


94 9653 




4768 


B 


582336 


1 570 
1 j 


F 






■J i 7/i 
Jl / 4 


rp 
F 


953340 




4769 


B 


584225 


i 5so 


F 


iyo/yo 




Jl /b 


F 


951431 


4770 


B 


582917 


1 SRI 

1 JO 1 


F 
r 


iy4oyo 


Jl / u 


rp 

F 


954174 


4771 


B 


584317 


1587 

1 JOZ 


F 
r 




Jl / / 


r 


352280 


4772 


B 


583359 


1 58^ 


F 


iyjj ju 


J 1 / 0 


r 


955475 


4773 


B 


585252 


1584 

1 J 0*T 


F 


1 Q7440 
1 7 /44U 


ji/y 


r 


953604 


4774 


B 


583734 


1 585 

1 JO J 


F 
r 


iyjj4y 


■j i on 
J 1 0 u 


TP 

r 


957601 


4775 


B 


585634 


1 586 

1JOU 


F 


iye/jo 


J 1 0 1 


r 


955703 


4776 


B 


584122 


1 587 


F 


1 QAS07 

lyosuz 


J loz 


rp 

r 


959053 


4777 


B 


585990 


1588 

1 JOO 


F 


1 00777 

iyy /zz 


Tin-) 
J lb J 


F 


957168 


4778 


B 


584665 


1589 


r 


1 Q7C77 
1 7 /oZZ 


J 1 04 


r 


960824 


4779 


B 


586598 


1 590 


F 


700007 


1 1 or 

Jlob 


rp 
r 


958932 


4780 


B 


585213 


1 5Q1 

1 J7 1 


F 
r 


1 QO 1 >17 

iyo 14/ 


J loo 


F 


961268 


4781 


B 


587123 


1 59? 


F 
r 


7007 a 1 


J 1 O / 


F 


9593 68 


4782 


B 


585517 


1 59^ 


F 
r 


iyo4Do 




*3 1 O Q 

J loo 


F 


961365 




4783 


B 


587434 


1 594 


F 
r 


7000^1^ 




olo9 


F 


959465 




4784 


B 


586850 


1595 


F 
r 


1 oonno 
lyyuuy 




jiy u 


TP 

F 


962324 




4785 


B 


588730 


1596 


F 
r 


707 1 77 
ZUZ IZZ 




j iy 1 


rp 
F 


960423 




4786 


B 


588017 


1597 


F 


ZV/UZ1 J 




7 1 QO 

j iy z 


t? 

r 


964706 




4787 


B 


589878 


1598 


F 
r 


">077^ 1 
dUJZ J 1 




j iy j 


rp 

F 


962822 




4788 


B 


589628 


1 599 

uyy 


F 

r 


■>rki 7^7 
tUi 33Z 




3194 


F 


965974 




4789 


B 


591543 


1600 


F , 


203807 




3195 


F 


964074 




A 7 on 




boy /yo 


1601 


f : 


201903 




3196 


F 


967210 




4791 


B j 


591723 


1602 


f ; 


206555 




3197 


F 


965307 




4792 


B ! 


590323 


1603 


f ; 


204669 




3198 


F 


967808 




4793 


B I 


592211 



WO 99/28475 
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SEQID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


1604 


F 


207269 




3199 


F 


965925 




4794 


B 


591492 


1605 


F 


205369 




3200 


F 


969286 




4795 


B 


593419 


1606 


F 


208293 




3201 


F 


967320 




4796 


B 


593002 


1607 


F 


206395 




3202 


F 


970802 




4797 


B 


594885 


1608 


F 


209252 




3203 


F 


968888 




4798 


B 


593367 


1609 


F 


207345 




3204 


F 


972169 




4799 


B 


595321 


1610 


F 


210330 




3205 


F 


970269 




4800 


B 


594166 


loll 


F 


208414 




3206 


F 


973487 




4801 


B 


596020 


1 £ 1 

1612 


F 


210632 




3207 


F 


971616 


4802 


B 


595942 


16 13 


F 


208694 


3208 


F 


974339 


4803 


B 


597826 


1614 


F 


211151 


3209 


F 


972408 


4804 


B 


596354 


1 1 c 

1 61 5 


F 


209255 


3210 


F 


974988 


4805 


B 


598255 


1616 


F 


212650 


3211 


F 


973035 . 


4806 


B 


597147 


1617 


F 


210756 


3212 


F 


976035 


4807 


B 


598998 


1 61 8 


F 


213920 


3213 


F 


974114 


4808 


B 


597960 


1619 


F 


212036 


3214 


F 


976367 


4809 


B 


599851 


1620 


F 


214535 


3215 


F 


974411 


4810 


B 


601068 


1621 


F 


212635 


3216 


F 


976665 


4811 


B 


602929 


1622 


F 


215003 


3217 


F 


974730 


4812 


B 


602096 


1623 


F 


213077 


3218 


F 


977439 


4813 


B 


603996 


1624 


F 


216641 


3219 


F 


975500 


4814 


B 


603761 


1625 


F 


214772 


3220 


F 


977698 


4815 


B 


605643 


1626 


F 


216869 


3221 


F 


975799 


4816 


B 


604014 


1627 


F 


214961 




3222 


F 


978389 


4817 


B 


605920 


1 1:10 

1628 


F 


218145 




3223 


F 


976478 




4818 


B 


604634 


1629 


F 


216218 




3224 


F 


978665 




4819 


B 


606548 


1630 


F 


218461 




3225 


F 


976760 




4820 


B 


605864 


1631 


F 


216576 




3226 


F 


979134 




4821 


B 


607736 


i y 1 1 

1632 


F 


218960 




3227 


F 


977270 




4822 


B 


606903 


1633 


F 


217044 




3228 


F 


979473 




4823 


B 


608742 


1 fi^d 

1 DJH 


r 








F 


977592 




4824 


B 


607722 


1635 


F 


1X1112 




3230 


F 


980915 




4825 


B 


609674 


1636 


F 


220257 




3231 


F 


979026 




4826 


B 


609329 


1637 


F 


218379 




3232 


F 


982043 




4827 


B 


611215 



WO 99/28475 
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1JJ 


Ur. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or, 


position 




u 
r 


220903 




O O *5 *5 

3233 


F 


980178 




4828 


B 


609916 


1639 


F 


218989 




3234 


F 


983628 




4829 


B 


611866 


1 HAf\ 

1640 


F 


221314 




3235 


F 


981736 




4830 


B 


612090 


1 HA 1 

1641 


F 


219470 




3236 


F 


984149 




4831 


B 


613998 


1642 


F 


222253 




3237 


F 


982271 




4832 


B 


613166 


1 HA*\ 


T7 

r 


220338 




•5 O O O 

3238 


F 


985180 




4833 


B 


615091 


1 A/l/i 


r 


223186 




3239 


F 


983280 




4834 


B 


613838 


1 HA* 
1045 


r 


OOI OTO 

2212/0 




3240 


F 


985815 




4835 


B 


615761 


1 A4A 
lo40 


T? 

r 


223994 




3241 


F 


983882 




4836 


B 


614217 


1 HA H 

164/ 


T? 

r 


222146 


3242 


F 


986458 


4837 


B 


616094 


1 HA Q 

1648 


F 


iijI a a o 

224908 


3243 


F 


984547 


4838 


B 


615464 


1 HACk 

1 64y 


tr 
r 


A 1 A 

223014 


3244 


F 


987340 


4839 


B 


617391 


1 A<A 


t? 
r 


225051 


3245 


F 


985462 


4840 


B 


615913 


1 A*\ 1 


c 
r 


223131 


3246 


F 


987686 


4841 


B 


617803 


1 ACO 
1 05Z 


r 


OOC C 1 A 

225510 


3247 


F 


985815 


4842 


B 


617932 


1 H<1 

1653 


F 


223615 


3248 


F 


988559 


4843 


B 


619837 


1 HZ A 

1654 


F 


226550 


3249 


F 


986670 


4844 


B 


618598 


1655 


F 


^o ^ /on 

224609 


3250 


F 


989455 


4845 


B 


620532 


i A«.A 
1656 


F 


226928 


3251 


F 


987558 


4846 


B 


619591 




T? 

r 


225029 


3252 


F 


993122 


4847 


B 


621465 


1 AC.8 
1035 


r 


227528 


3253 


F 


991289 


4848 


B 


620636 


1 A*aG 


T7 

r 


225631 


3254 


F 


993186 


4849 


B 


622536 


1 AAA 


r 


000*5 00 

228388 


3255 


F 


991297 


4850 


B 


620532 


1 AA1 


F 


226475 


3256 


F 


993650 


4851 


B 


622381 


1 AAO 


F 


229930 


3257 


F 


991727 


4852 


B 


621404 


1 AA7 


r 


oo onoo 
228032 


3258 


F 


994007 




4853 


B 


623307 


1 AA/1 


r 


Ol 1 1 OA 

231129 




3259 


F 


992107 




4854 


B 


621932 


1 AA^ 


1? 

r 


OOQOAC 

229205 




3260 


F 


995743 




4855 


B 


623834 


1 AAA 


T 

r 


232785 




3261 


F 


993915 




4856 


B 


622666 


1 AA*7 

166/ 


F 


230915 




3262 


F 


996663 




4857 


B 


624588 


1668 




7^^SA1 




7,0 A** 


T? 

r 


yy4 /o3 




4858 


B 


623689 


1669 


F 


231664 




3264 


F 


998586 




4859 


B 


625605 


1670 


F 


234013 




3265 


F 


996755 




4860 


B 


624435 


1671 


F 


232149 




3266 


F 


999153 




4861 


B 


626298 



WO 99/28475 
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5>EQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or, 


position 


lo/z 


r 


234942 




3267 


F 


997253 




4862 


B 


625204 


1 £*7*5 


r 


233061 




3268 


F 


1000967 




4863 


B 


627128 


1 fJlA 
lO /4 


r 


236015 




3269 


F 


999092 




4864 


B 


626885 


lO/J 


r 


234123 




3270 


F 


1001173 




4865 


B 


628790 


1 ana 


F 


237945 




3271 


F 


999246 




4866 


B 


627128 


It)/ / 


F 


236045 




3272 


F 


1001604 




4867 


B 


629026 


1678 


F 


238482 




3273 


F 


999645 




4868 


B 


628073 


io/y 


r 


236599 




3274 


F 


1004159 




4869 


B 


629983 


lool) 


r 


240094 




3275 


F 


1002326 




4870 


B 


628359 


IDS 1 


r 


HOI flfk 

238190 




3276 


F 


1004763 




4871 


B 


630267 




r 


241713 




3277 


F 


1002871 




4872 


B 


628976 


1 /CCO 

loo3 


r 


239820 


3278 


F 


1005160 


4873 


B 


63085C 


1684 


F 


242569 


3279 


F 


1003235 


4874 


B 


630023 


loo3 


T? 

r 


240653 


3280 


F 


1007181 


4875 


B 


631988 


looo 


IT" 

F 


244253 


3281 


F 


1005250 


4876 


B 


630642 


105/ 


r 


242360 


3282 


F 


1007561 


4877 


B 


632526 


looo 


r 


245693 


3283 


F 


1005665 


4878 


B 


631205 


1 ACQ 

looy 


F 


243796 


3284 


F 


1008855 


4879 


B 


633081 


loyu 


F 


246762 


3285 


F 


1007002 


4880 


B 


632046 


loy l 


r 


244825 


3286 


F 


1010205 


4881 


B 


633969 


1 ^GO 

loVz 


F 


247498 


3287 


F 


1008342 


4882 


B 


638311 


loyj 


F 


245575 


3288 


F 


1011716 


4883 


B 


640204 




•c 
r 


248343 


3289 


F 


1009823 


4884 


B 


640448 


loy j 


F 


/f £. A A A 

246444 




3290 


F 


1011812 




4885 


B 


642328 


loyo 


F 


1 /lAf An 

249500 




3291 


F 


1009914 




4686 


B 


643695 


1 AG7 

ioy / 


t 
r 


247625 




3292 


F 


1012372 




4887 


B 


645639 


loyo 


F 


250315 




3293 


F 


1010385 




4888 


B 


640847 


ioyy 


r 


248425 




3294 


F 


1012567 




4889 


B 


642769 


l /uu 


F 


250832 




3295 


F 


1010624 




4890 


B 


S44745 


1 /Ul 


F , 


248942 




3296 


F 


1013237 




4891 


B 


546615 




r 


d j 1 84 / 




*5 O O *7 

32 97 


F 


1011337 




4892 


B 


545686 


1703 


f : 


249939 




3298 


F 


1013690 




4893 


B i 


547558 


1704 


f : 


254897 




3299 


F 


1011856 




4894 


B ( 


546060 


1705 


f : 


252955 




3300 


F 


1014301 




4895 


B t 


547972 
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SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


1706 


F 


256543 




3301 


F 


1012396 




4696 


B 


647331 


1707 


F 


254643 




3302 


F 


1014926 




4897 


B 


649231 


1708 


F 


257692 




3303 


F 


1013010 




4898 


B 


649987 


1709 


F 


255790 




3304 


F 


1015664 




4899 


B 


651829 


1710 


F 


258561 




3305 


F 


1013820 




4900 


B 


650580 


1711 


F 


256651 




3306 


F 


1017026 




4901 


B 


652484 


1712 


F 


258927 




3307 


F 


1015099 




4902 


B 


651942 


1713 


F 


257036 




3308 


F 


1017674 




4903 


B 


653852 


1714 


F 


261368 




3309 


F 


1015786 




4904 


B 


652395 


1715 


F 


259469 




3310 


F 


1018353 




4905 


B 


654310 


1716 


F 


263887 




3311 


F 


1016460 


4906 


B 


653132 


1717 


F 


262000 


3312 


F 


1019602 


4907 


B 


655028 


1718 


F 


264503 


3313 


F 


1017674 


4908 


B 


653827 


1719 


F 


262599 


3314 


F 


1019876 


4909 


B 


655713 


1720 


F 


265364 


3315 


F 


1017948 


4910 


B 


662071 


1721 


F 


263512 


3316 


F 


1020853 


4911 


B 


664023 


1722 


F 


266202 


3317 


F 


1018956 


4912 


B 


662543 


1723 


F 


264277 


3318 


F 


1021878 


4913 


B 


664403 


1724 


F 


266709 


3319 


F 


1019972 


4914 


B 


663295 


1725 


F 


264801 


3320 


F 


1023054 


4915 


B 


665205 


1726 


F 


267847 


3321 


F 


1021186 


4916 


B 


663972 


1727 


F 


265947 


3322 


F 


1023415 




4917 


B 


665850 


1728 


F 


267980 


3323 


F 


1021579 




4918 


B 


664432 


1729 


F 


266077 




3324 


F 


1023748 




4919 


B 


666332 


1730 


F 


268271 




3325 


F 


1021850 




4920 


B 


665860 


1731 


F 


266341 




3326 


F 


1024485 




4921 


B 


667789 


1732 


F 


269840 




3327 


F 


1022574 




4922 


B 


666312 


1733 


F , 


267913 




3328 


F 


1024744 




4923 


B 


668233 


1734 


F : 


270961 




3329 


F 


1022836 




4924 


B 


666652 


1735 


f ; 


269072 




3330 


F 


1025618 




4925 


B 


668550 


1 TX< 
I /DO 


r 


271883 




3331 


F 


1023720 




4926 


B 


668338 


1737 


f : 


£70080 




3332 


F 


1026323 




4927 


B 


570238 


1738 


f : 


272642 




3333 


F 


1024403 




4928 


B 


568605 


1739 


f ; 


270748 




3334 


F ' 


L027710 




4929 


B 1 


570495 
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244 



PCT71B98/01939 



SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


p sition 


1740 


F 


273367 




3335 


F 


1025809 




4930 


B 


668690 


1741 


F 


271477 




3336 


F 


1030272 




4931 


B 


670590 


1742 


F 


274562 




3337 


F 


1028389 




4932 


B 


669766 


1743 


F 


272702 




3338 


F 


1031486 




4933 


B 


671653 


1744 


F 


275882 




3339 


F 


1029602 




4934 


B 


670160 


1745 


F 


273984 




3340 


F 


1033215 




4935 


B 


672109 


1746 


F 


278004 




3341 


F 


1031334 




4936 


B 


671000 


1747 


F 


276149 




3342 


F 


1035425 




4937 


B 


672900 


1748 


F 


278747 


3343 


F 


1033555 




4938 


B 


671470 


1749 


F 


276893 


3344 


f 


1035956 


4939 


B 


673412 


1750 


F 


279521 


3345 


F 


1034055 


4940 


B 


672685 


1751 


F 


277632 


3346 


F 


1036748 


4941 


B 


674567 


1752 


F 


281076 


3347 


f 


1034844 


4942 


B 


673461 


1753 


F 


279118 


3348 


f 


16372 


4943 


B 


675365 


1754 


F 


281551 


3349 


f 


14463 


4944 


B 


674786 


1755 


F 


279668 


3350 


f 


31184 


4945 


B 


676682 


1756 


F 


282573 


3351 


f 


29287 


4946 


B 


675456 


1757 


F 


280663 


3352 


f 


56283 


4947 


B 


677375 


1758 


F 


284229 


3353 


f 


54383 


4948 


B 


676683 


1759 


F 


282316 


3354 


F 


56384 


4949 


B 


678594 


1760 


F 


284598 


3355 


F 


54538 


4950 


B 


677334 


1761 


F 


282655 


3356 


F 


64528 


4951 


B 


679183 


1762 


F 


285418 


3357 


F 


62600 


4952 


B 


678726 


1763 


F 


283518 




3358 


F 


72965 




4953 


B 


680596 


1764 


F 


286104 




3359 


F 


71054 




4954 


B 


679729 


1765 


F 


284229 




3360 


F 


78245 




4955 


B 


681628 


1766 


F 


286456 




3361 


F 


76347 




4956 


B 


680747 


1767 


F 


284531 




3362 


F 


79133 




4957 


B 


682668 


1768 


F 


287865 




3363 


F 


77291 




4958 


B 


681500 


1769 


F 


286008 




3364 


F 


81740 




4959 


B 


683406 


1 77ft 


r 


")QQ1 

toy loJ 




3365 


F 


79840 




4960 


B 


682779 


MIX 


F , 


287384 




3366 


F 


86772 




4961 


B 


684716 


mi 


f ; 


290609 




3367 


F 


84880 




4962 


B 


683320 


1773 


f ; 


288709 




3368 


F 


109188 




4963 


B 


585249 
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SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


1774 


F 


291264 




3369 


F 


107337 




4964 


B 


684716 


1775 


F 


289389 




3370 


F 


111132 




4965 


B 


686565 


1776 


F 


292107 




3371 


F 


109188 




4966 


B 


685010 


1777 


F 


290166 




3372 


F 


111505 




4967 


B 


686897 


1778 


F 


293099 




3373 


F 


109597 




4968 


B 


686423 


1779 


F 


291211 




3374 


F 


112432 




4969 


B 


688323 


1780 


F 


294791 




3375 


F 


110462 




4970 


B 


687426 


1781 


F 


292883 


3376 


F 


113446 




4971 


B 


689324 


1782 


F 


295464 


3377 


F 


111592 




4972 


B 


688619 


1783 


F 


293573 


3378 


F 


120225 


4973 


B 


690482 


1784 


F 


296018 


3379 


F 


118303 


4974 


B 


688653 


1785 


F 


294095 


3380 


F 


124892 


4975 


B 


690563 


1786 


F 


297572 


3381 


F 


123004 


4976 


B 


689836 


1787 


F 


295664 


3382 


F 


131327 


4977 


B 


691775 


1788 


F 


298686 


3383 


F 


129485 


4978 


B 


690186 


1789 


F 


296716 


3384 


F 


143944 


4979 


B 


692088 


1790 


F 


300305 


3385 


F 


142043 


4980 


B 


690715 


1791 


F 


298407 


3386 


F 


150138 


4981 


B 


692616 


1792 


F 


301852 


3387 


F 


148247 


4982 


B 


690937 


1793 


F 


299946 


3388 


F 


163715 


4983 


B 


692837 


1794 


F 


304754 


3389 


F 


161804 


4984 


B 


692091 


1795 


F 


302849 


3390 


F 


165186 


4985 


B 


693991 


1796 


F 


305854 


3391 


F 


163274 


4986 


B 


694171 


1797 


F 


303992 


3392 


F 


168143 




4987 


B 


696078 


1798 


F 


306214 


3393 


F 


166302 




4988 


B 


695197 


1799 


F 


304303 




3394 


F 


170287 




4989 


B 


697093 


1800 


F 


306758 




3395 


F 


168387 




4990 


B 


697486 


1801 


F 


304856 




3396 


F 


176838 




4991 


B 


699428 


1802 


F 


309057 




3397 


F 


174996 




4992 


B 


698313 


1803 


F 


307125 




3398 


F 


187776 




4993 


B 


700238 




F 


1 Art /*i c 

309635 




3399 


F 


185900 




4994 


B 


698646 


1805 


F 


307750 




3400 


F 


188083 




4995 


B 


700515 


1806 


F 


310491 




3401 


F 


186208 




4996 


B 


700337 


1807 


F 


308597 




3402 


F 


190117 




4997 


B 


702249 



WO 99/28475 
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OT7 1 /^V TI\ 

SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or, 


position 


1808 


F 


311753 




3403 


F 


188168 




4998 


B 


701115 


1809 


F 


309790 




3404 


F 


196802 




4999 


B 


703015 


1 O 1 A 

1810 


F 


313188 




3405 


F 


194946 




5000 


B 


702385 


1011 

151 1 


F 


31 1292 




3406 


F 


210685 




5001 


B 


704285 


i oil 


F 


314121 




3407 


F 


208785 




5002 


B 


703636 


1813 


r 


312194 




3408 


F 


234633 




5003 


B 


705561 


1 O 1 A 

1814 


F 


314489 




3409 


F 


232727 




5004 


B 


705271 


1 O 1 c 

1815 


F 


312539 




3410 


F 


236682 




5005 


B 


707136 


101/ 
1816 


F 


315431 




3411 


F 


234794 


5006 


B 


705875 


ion 

1817 


F 


313526 


3 412 


F 


249227 


5007 


B 


707725 


1818 


F 


316309 


3413 


F 


247310 


5008 


B 


706444 


1 O 1 A 

loiy 


F 


314380 


3414 


F 


252939 


5009 


B 


708279 


1 OTA 

1820 


F 


317102 


3415 


F 


251036 


5010 


B 


706741 


1821 


F 


315214 


3416 


F 


253406 


5011 


B 


708673 


1822 


F 


317271 


3417 


F 


251562 


5012 


B 


708324 


1823 


F 


315343 


3418 


F 


271365 


5013 


B 


710226 


1824 


F 


317380 


3419 


F 


269466 


5014 


B 


708673 


1825 


F 


315480 


3420 


F 


275390 


5015 


B 


710518 


1 OO/C 

1826 


F 


318256 


3421 


F 


273489 


5016 


B 


708876 


loz/ 




316352 


3422 


F 


277681 


5017 


B 


710791 


1828 


F 


319047 


3423 


F 


275765 


5018 


B 


710498 


1829 


F 


317181 


3424 


F 


282260 


5019 


B 


712447 


1 C3 A 

1830 


F 


320325 


3425 


F 


280357 


5020 


B 


711435 


1 Q1 1 

183 1 


F 


318338 


3426 


F 


292925 


5021 


B 


713354 


1 CIO 

lo32 


T7 

r 


321228 


3427 


F 


291054 


5022 


B 


712993 


1 C37 


-c 
r 


0 1 ao ac 
319366 


3428 


F 


302910 


5023 


B 


714887 


1 Q1 A 


F 


321676 


3429 


F 


301032 




5024 


B 


713686 


1835 


F 


319782 




3430 


F 


308746 




5025 


B 


715574 


1836 


F 


322066 




3431 


F 


306806 




5026 


B 


714474 


1837 


F 


320097 




3432 


F 


311994 




5027 


B 


716354 


1 JHR 


r 






*3 A *5 *5 


F 


310073 




5028 


B 


714867 


1839 


F 


320982 




3434 


F 


312375 




5029 


B 


716760 


1840 


F 


324744 




3435 


F 


310483 




5030 


B 


716047 


1841 


F 


322849 




3436 


F 


312531 




5031 


B 


717877 
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SEQ ID 


Or, 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


1842 


F 


325392 




3437 


F 


310647 




5032 


B 


716086 


1843 


F 


323445 




3438 


F 


319923 




5033 


B 


717976 


1 O A A 

1844 


F 


326217 




3439 


F 


318009 




5034 


B 


717189 


1 O A £ 

1845 


F 


324331 




3440 


F 


339991 




5035 


B 


719068 


1846 


F 


327038 




3441 


F 


338104 




5036 


B 


718624 


1847 


F 


325162 




3442 


F 


352535 




5037 


B 


720503 


1848 


F 


327957 




3443 


F 


350653 




5038 


B 


719083 


1849 


F 


326079 




3444 


F 


373218 




5039 


B 


720983 


1850 


F 


328458 


3445 


F 


371320 


5040 


B 


720047 


1851 


F 


326612 


3446 


F 


376994 


5041 


B 


722004 


1852 


F 


329032 


3447 


F 


375085 


5042 


B 


720503 


1853 


F 


327173 


3448 


F 


378954 


5043 


B 


722393 


1854 


F 


329329 


3449 


F 


377011 


5044 


B 


720753 


1855 


F 


327489 


3450 


F 


394604 


5045 


B 


722653 


1 or r 

1856 


F 


330446 


3451 


F 


392704 


5046 


B 


721798 


1 O C 1 

1857 


F 


328551 


3452 


F 


400915 


5047 


B 


723724 


i o r o 

1858 


F 


« o f\f\ \ P 

330915 


3453 


F 


398972 


5048 


B 


722631 


1 O CA 

1859 


F 


329032 


3454 


F 


409744 


5049 


B 


724493 


I860 


F 


331410 


3455 


F 


407904 


5050 


B 


723468 


1 Of 1 

1861 


F 


329602 


3456 


F 


411155 


5051 


B 


725376 


1862 


F 


332534 


3457 


F 


409253 


5052 


B 


724852 


1863 


F 


330626 


3458 


F 


414197 


5053 


B 


726743 


1864 


F 


332782 


3459 


F 


412281 




5054 


B 


726005 


1865 


F 


330879 


3460 


F 


422638 




5055 


B 


727903 


1866 


F 


333587 




3461 


F 


420770 




5056 


B 


726779 


1867 


F 


331632 




3462 


F 


427595 




5057 


B 


728691 


1868 


F 


333870 




3463 


F 


425701 




5058 


B 


727058 


1869 


F 


331962 




3464 


F 


428453 




5059 


B 


728947 


1 OTA 

1870 


F 


334510 




3465 


F 


426553 




5060 


B 


727727 


10*71 

1871 


F 


332594 




3466 


F 


442272 




5061 


B 


729613 


1 R79 


r 








t? 
r 


A A (\ ~\ f? A 

440364 




5062 


B 


728224 


1873 


F 


333049 




3468 


F 


443303 




5063 


B 


730116 


1874 


F . 


334958 




3469 


F 


441380 




5064 


B 


729048 


1875 


F . 


333049 




3470 


F 


442939 




5065 


B 


730907 



WO 99/28475 
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SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


p sition 


1876 


F 


335655 




3471 


F 


441047 




5066 


B 


729566 


1877 


F 


333766 




3472 


F 


445572 




5067 


B 


731468 


1 OTO 

1878 


F 


336117 




3473 


F 


443707 




5068 


B 


732909 


1879 


F 


334219 




3474 


F 


467757 




5069 


B 


734770 


1 ODA 

1880 


F 


337108 




3475 


F 


465801 




5070 


B 


734663 


1 OO 1 

1881 


F 


335210 




3476 


F 


471583 




5071 


B 


736569 


1882 


F 


340251 




3477 


F 


469712 




5072 


B 


735879 


1883 


F 


338372 




3478 


F 


487813 




5073 


B 


737785 


1884 


F 


341538 




3479 


F 


485913 




5074 


B 


736724 


1 nor 

1885 


F 


339662 




3480 


F 


496852 


5075 


B 


738632 


too/ 

1886 


F 


341953 


3481 


F 


494952 


5076 


B 


737474 


1887 


F 


339995 


3482 


F 


499979 


5077 


B 


739421 


1888 


F 


342348 


3483 


F 


498074 


5078 


B 


738007 


1889 


F 


340450 


3484 


F 


508715 


5079 


B 


739907 


1890 


F 


343112 


3485 


F 


506798 


5080 


B 


738911 


1 on 1 

1891 


F 


341242 


3486 


F 


510584 


50B1 


B 


740799 


1892 


F 


343736 


3487 


F 


508632 


5082 


B 


739960 


1893 


F 


341811 


3488 


F 


526255 


5083 


B 


741908 


1894 


F 


344117 


3489 


F 


524350 


5084 


B 


742277 


1 one 

1895 


F 


342207 


3490 


F 


531098 


5085 


B 


744187 


1896 


F 


344940 


3491 


F 


529150 


5086 


B 


743089 


1 00*7 

1897 


F 


343000 


3492 


F 


556575 


5087 


B 


744989 


1 ono 

1898 


F 


345837 


3493 


F 


554706 


5088 


B 


743603 


1899 


F 


A A /■% a mm r% 

343958 


3494 


F 


564318 




5089 


B 


745539 


1900 


F 


346872 


3495 


F 


562390 




5090 


B 


744565 


1901 


F 


344994 




3496 


F 


566692 




5091 


B 


746432 


1902 


F 


347910 




3497 


F 


564838 




5092 


B 


744977 


1903 


F 


345971 




3498 


F 


570033 




5093 


B 


746867 


1904 


F 


350124 




3499 


F 


568150 




5094 


B 


745249 


1905 


F 


348298 




3500 


F 


570844 




5095 


B 


747138 


1906 


r 






JDUl 


r 


c c o o n c 

568915 




5096 


B 


745777 


1907 


F 


349167 




3502 


F 


575571 




5097 


B 


747677 


1908 


F . 


351996 




3503 


F 


573671 




5098 


B 


746632 


1909 


f : 


350122 




3504 


F 


590045 




5099 


B 


748532 
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SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


1910 


F 


353051 




3505 


F 


588196 




5100 


6 


747054 


1911 


F 


351186 




3506 


F 


597631 




5101 


B 


748893 


1912 


F 


353413 




3507 


F 


595698 




5102 


B 


748519 


1913 


F 


351481 




3508 


F 


606387 




5103 


B 


750396 


1914 


F 


353908 




3509 


F 


604507 




5104 


B 


749186 


1915 


F 


351996 




3510 


F 


607566 




5105 


B 


751108 


1916 


F 


354723 




3511 


F 


605637 




5106 


B 


749646 


1917 


F 


352799 




3512 


F 


609842 




5107 


B 


751546 


1918 


F 


356466 




3513 


F 


607958 




5108 


B 


749922 


1919 


F 


354569 


3514 


F 


632472 


5109 


B 


751824 


1920 


F 


357107 


3515 


F 


630572 


5110 


B 


750260 


1921 


F 


355178 


3516 


F 


636994 


5111 


B 


752151 


1922 


F 


357767 


3517 


F 


635071 


5112 


B 


752527 


1923 


F 


355878 


3518 


F 


649681 


5113 


B 


754427 


1924 


F 


360528 


3519 


F 


647800 


5114 


B 


753169 


1925 


F 


358628 


3520 


F 


652059 


5115 


B 


755064 


1926 


F 


360877 


3521 


F 


650101 


5116 


B 


755004 


1927 


F 


358974 


3522 


F 


654522 


5117 


B 


756843 


1928 


F 


361573 


3523 


F 


652562 


5118 


B 


757034 


1929 


F 


359692 


3524 


F 


660587 


5119 


B 


758991 


1930 


F 


362584 


3525 


F 


658691 


5120 


B 


758532 


1931 


F 


360681 


3526 


F 


676785 


5121 


B 


760452 


1932 


F 


363835 


3527 


F 


674938 


5122 


B 


758911 


1933 


F 


361966 


3528 


F 


679031 


5123 


B 


760841 


1934 


F 


364960 


3529 


F 


677133 


5124 


B 


760015 


1935 


F 


363021 




3530 


F 


731967 


5125 


B 


761913 


1936 


F 


365240 




3531 


F 


730091 




5126 


B 


760463 


1937 


F 


363360 




3532 


F 


741797 




5127 


B 


762363 


1938 


F 


367060 




3533 


F 


739935 




5128 


B 


760782 


1939 


F 


365115 




3534 


F 


758555 




5129 


B 


762671 




r 


368383 




3535 


F 


756641 




5130 


B 


762053 


1941 


F 


366505 




3536 


F 


760010 




5131 


B 


763911 


1942 


F . 


368862 




3537 


F 


758082 




5132 


B 


762363 


1943 


f : 


366963 




3538 


F 


770670 




5133 


B 


764264 
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SEQID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


1944 


F 


370513 




3539 


F 


768751 




5134 


B 


763203 


1945 


F 


368631 




3540 


F 


771896 




5135 


B 


765107 


1946 


F 


370974 




3541 


F 


769996 




5136 


B 


764690 


1947 


F 


369076 




3542 


F 


787857 




5137 


B 


766595 


1948 


F 


372891 




3543 


F 


785958 




5138 


B 


765107 


1949 


F 


370980 




3544 


F 


815714 


5139 


B 


766977 


1950 


F 


373395 




3545 


F 


813840 




5140 


B 


766327 


1951 


F 


371495 




3546 


F 


846380 




5141 


B 


768221 


1952 


F 


374005 




3547 


F 


844470 


5142 


B 


766932 


1953 


F 


372033 


3548 


F 


867576 


5143 


B 


768851 


1954 


F 


374474 


3549 


F 


865673 


5144 


B 


768314 


1955 


F 


372572 


3550 


F 


875167 


5145 


B 


770221 


1956 


F 


376509 


3551 


F 


873254 


5146 


B 


769045 


1957 


F 


374624 


3552 


F 


876214 


5147 


B 


770945 


1958 


F 


377630 


3553 


F 


874314 


5148 


B 


770315 


1959 


F 


375708 


3554 


F 


884093 


5149 


B 


772234 


1960 


F 


378384 


3555 


F 


882162 


5150 


B 


770705 


1961 


F 


376507 


3556 


F 


891248 


5151 


B 


772598 


1962 


F 


378798 


3557 


F 


889348 


5152 


B 


770882 


1963 


F 


376871 


3558 


F 


900125 


5153 


B 


772781 


1964 


F 


379413 


3559 


F 


898298 


5154 


B 


771156 


1965 


F 


377501 


3560 


F 


902048 


5155 


B 


773044 


1966 


F 


379890 


3561 


F 


900125 


5156 


B 


772234 


1967 


F 


377989 




3562 


F 


907563 


5157 


B 


774148 


1968 


F 


381241 




3563 


F 


905656 


5158 


B 


773611 


1969 


F 


379348 




3564 


F 


912076 




5159 


B 


775511 


1970 


F 


382485 




3565 


F 


910133 




5160 


B 


774513 


1971 


F 


380579 




3566 


F 


935157 




5161 


B 


776404 


1972 


F 


383395 




3567 


F 


933211 




5162 


B 


776333 


1973 


F 


381536 




3568 


F 


946473 




5163 


B 


778191 


1 Q1A 


r 


383730 




3569 


F 


944568 




5164 


B 


777926 


1975 


F 


381782 




3570 


F 


952562 




5165 


B 


779832 


1976 


F 


384948 




3571 


F 


950664 




5166 


B 


777455 


1977 


F 


383057 




3572 


F 


965649 




5167 


B 


779380 
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SEQID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


1978 


F 


385474 




3573 


F 


963730 




5168 


B 


779476 


1979 


F 


383532 




3574 


F 


968519 




5169 


B 


781342 


1980 


F 


385908 




3575 


F 


966614 




5170 


B 


781774 


i no i 


F 


384008 




3576 


F 


970497 




5171 


B 


783686 


1 QCO 


T? 

r 


386643 




3577 


F 


968601 




5172 


B 


782667 


1 OOQ 


r 


384750 




3578 


F 


971879 




5173 


B 


784562 


1 QQA 


r 


387099 




3579 


F 


970043 




5174 


B 


785748 


1985 


F 


385204 




3580 


F 


972888 




5175 


B 


787658 


1 OO/C 

19oq 


r 


387581 




3581 


F 


970962 




5176 


B 


786222 


1 QO*7 


r 


IOC ZT - 7*7 

385677 




3582 


F 


998162 




5177 


B 


788126 


1 QOO 

1988 


F 


") OOAAA 

388009 




3583 


F 


996241 




5178 


B 


786803 


1 QOO 

iyo9 


i? 
r 


386062 


3584 


F 


1003657 


5179 


B 


788703 


199U 


r 


388927 


3585 


F 


1001756 


5180 


B 


787998 


1991 


F 


387033 


3586 


F 


1009313 


5181 


B 


789876 


1 OQ1 

1992 


F 


389726 


3587 


F 


1007413 


5182 


B 


788279 


1 GOT 

1993 


F 


387821 


3588 


F 


1028954 


5183 


B 


790255 


1 GO /I 

1994 


F 


391295 


3589 


F 


1027039 


5184 


B 


790369 


1 OQC 

1995 


F 


1 O A*> 

389365 


3590 


B 


730 


5185 


B 


792247 


199o 


F 


392171 


3591 


B 


2645 


5186 


B 


790862 


1 00*7 

iyy I 


F 


390291 


3592 


B 


3521 


5187 


B 


792787 


1 QOO 

1998 


F 


393930 


3593 


B 


5431 




5188 


B 


792247 


1 OQG 

1V99 


T? 

F 


a** a 1 >i 

392014 


3594 


B 


5295 




5189 


B 


794137 




r 


395085 


3595 


B 


7188 




5190 


B 


793352 


ZUU 1 


r 


393185 


3596 


B 


6740 




5191 


B 


795215 




r 


1 Af O^l 

395827 




3597 


B 


8652 




5192 


B 


794276 


ZUUJ 


r 


393940 




3598 


B 


8240 




5193 


B 


796196 


2UU4 


F 


396274 




3599 


B 


10138 




5194 


B 


795215 




F 


394423 




3600 


B 


8959 




5195 


B 


797077 


o aa£ 


F 


397156 




3601 


B 


10816 




5196 


B 


795667 


2007 


F 


395216 




3602 


B 


9285 




5197 


B 


797571 


2008 










rs 

D 


11160 




5198 


B 


796515 


2009 


F 


396790 




3604 


B 


9689 




5199 


B 


798382 


2010 


F . 


399550 




3605 


B 


11591 




5200 


B 


797235 


2011 


f : 


397659 




3606 


B . 


L0679 




5201 


B 


799168 



WO 99/28475 



252 



PCT/IB98/01939 



ak,\i ID 


Ur. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


OA1 O 

2012 


T7 

r 


399797 




3607 


B 


1256B 




5202 


B 


797585 


OA 1 0 

2013 


F 


t Ann i c 

397915 




3608 


B 


11515 




5203 


B 


799553 


OA 1 >1 

2014 


F 


401527 




3609 


B 


13376 




5204 


B 


798102 


2015 


17 

F 


399623 




3610 


B 


12136 




5205 


B 


799976 


OA 1 H 

2U16 


T7 

r 


401907 




3611 


B 


14038 




5206 


B 


798306 


om 7 
20 1 / 


T7 

r 


0 AAAiCA 

399960 




3612 


B 


13565 




5207 


B 


800214 


OA1 Q 

20 lo 


T7 

r 


/I AO A 1 "7 

403017 




3 613 


B 


15450 




5208 


B 


798662 


2019 


T7 

r 


401 131 




3614 


B 


14667 




5209 


B 


800546 


OAOA 

2020 


r 


A A*5 All 

403017 




3615 


B 


16535 




5210 


B 


799168 


O AO 1 

2021 


F 


401 I3l 




3616 


B 


15254 




5211 


B 


801074 


OAOO 

2022 


17 

r 


404910 


3617 


B 


17183 


5212 


B 


800159 


ZUZ3 


r 


/I AO A1 A 

403010 


3618 


B 


16189 


5213 


B 


802054 




r 


405 /2o 


3619 


B 


18097 


5214 


B 


801153 


OAO< 

2025 


T7 

r 


403836 


3620 


B 


16790 


5215 


B 


803031 


ZUZO 


TO 

r 


406537 


3621 


B 


18621 


5216 


B 


802363 


OAOO 

2027 


F 


404932 


3622 


B 


16710 


5217 


B 


804257 


OAOO 

2028 


T-> 

r 


410291 


3623 


B 


18587 


5218 


B 


802893 


OAOA 

2029 


F 


408347 


3624 


B 


18458 


5219 


B 


804802 


OA2 A 

2030 


F 


/I 1 1 v( DO 

41 1488 


3625 


B 


20318 


5220 


B 


803466 


oao 1 
ZU3 1 


r 


4095 1 8 


3626 


B 


20768 


5221 


B 


805366 


OAIO 

2032 


T7 

r 


412379 


3627 


B 


22647 


5222 


B 


804440 


OAIO 

2033 


T7 

F 


410487 


3628 


B 


22438 


5223 


B 


806374 


2034 


F 


413164 


3629 


B 


24335 


5224 


B 


805576 


OA1 C 

2035 


T7 

F 


411263 


3630 


B 


22892 




5225 


B 


807523 


ZU30 


r 


413606 




3631 


B 


24752 




5226 


B 


806511 


ZU J / 


r 


>|1 1 iCO/C 

41 1626 




3632 


B 


23046 




5227 


B 


808410 


ZUJo 


r 


A 1 0 70 1 

413 /21 




■5 e? i i 

3633 


B 


24924 




5228 


B 


808305 


OHIO 

203y 


r 


a i 1 ocn 

411859 




3634 


B 


23442 




5229 


B 


810207 


OA>l A 

2040 


17 

F - 


414921 




3635 


B 


25313 




5230 


B 


809229 


2041 


F * 


413049 




3636 


B 


23623 




5231 


B 


B11164 


2042 


X * 


T 1 \jD 1 / 




jOj / 


D 


0 c c o o 




5232 


B 


B10700 


2043 


F - 


414606 




3638 


B 


24822 




5233 


B 


B12581 


2044 


F ' 


417445 




3639 


B < 


26708 




5234 


B 


311825 


2045 


F ' 


415557 




3640 


B . 


26618 




5235 


B 1 


313787 



WO 99/28475 



253 



PCT/IB98/01939 



CIFO IT* 

o&yl IV 


Ur. 


position 




SEQ ID 


Or . 


position 


SEQ ID 


Or. 


position 


ZU40 


p 
r 


A 1 *7Q1 O 

41 /5lz 




3641 


B 


28544 


5236 


B 


812574 


9047 


r 


4ioyiz 




T £" A 1 

3642 


B 


27972 


5237 


B 


814467 


ZU*ro 


p 
r 


/1 1 CI 01 




T C A *3 

3643 


B 


29859 


5238 


B 


813787 


90AQ 


p 
r 


41 A^l 7 

410j1 / 




3b44 


B 


T ft ft ft ft 

29000 


5239 


B 


815676 


90^0 


p 
r 


A 1 A/1 CJ 

419433 




364b 




30908 


5240 


B 


814082 


90^ i 

ZIO 1 


p 

r 


A 1 *7/l /IT 

41 /44 / 




3646 


B 


30242 


5241 


B 


815892 


ZUDZ 


p 
r 


420026 




3647 


B 


32160 


5242 


B 


814571 


90^1 


p 


/| 1 OA/CO 

4180oo 




3648 


B 


30966 


5243 


B 


816454 


90^4 


p 
r 


/IT 1 1 1 A 
4Z1Z /0 




T A ft 

364 9 


B 


32849 


5244 


B 


815441 


90^ 


p 


4iyjyz 




T ^ C ft 

3bbU 


B 


3 1516 


5245 


B 


817347 


90^6 

ZuOO 


p 
r 


4ZZZ0Z 




3651 


B 


33401 


5246 


B 


815101 


9fK7 

ZUO / 


p 
r 


4Z0364 


3652 


B 


32032 


5247 


B 


817025 


90^S 


p 
r 


423007 


3653 


B 


33900 


5248 


B 


815953 


90^0 

zuoy 


p 
r 


421 107 


3654 


B 


33576 


5249 


B 


817887 


9060 


p 
r 


/!T/I 01/1 

4Z4K34 


T r r r 

3655 


B 


35496 


5250 


B 


817709 


9061 


p 
r 


4Z3U1K 


3 656 


B 


34121 


5251 


B 


819618 


2069 


p 


4Z0UZ0 


t c *7 
3 bb / 


B 


36052 


5252 


B 


819559 


9061 


p 
r 


4Z41 10 


1 c c o 

3 boo 


B 


35251 


5253 


B 


821442 




p 
r 


4Zooo3 


T £T C ft 

3659 


B 


37164 


5254 


B 


819933 


906^ 
ZUO J 


p 
r 


/I OCA/1 A 

4Z>040 


3660 


B 


35588 


5255 


B 


821846 


ZuOD 


p 
r 


429303 


3661 


B 


37502 


5256 


B 


820622 


9067 


p 
r 


/I T*7/1 "3 O 

4Z /43o 


3662 


B 


36152 


5257 


B 


822548 


2068 


p 
r 


47QAA.Q 

4zyouy 




Job J 


B 


38041 


5258 


B 


821281 


9060 


p 
r 


HZ/ /J f 




3664 


B 


37216 


5259 


B 


823181 


9070 
/ u 


p 
r 


A1 A 1 A/1 

430104 




3665 


B 


39127 


5260 


B 


821778 


9071 


p 
r 


^zoiy i 




3666 


B 


38041 


5261 


B 


823666 


9079 
ZU /Z 


r 


/1 1 n^Ti 

430622 




3667 


B 


39980 


5262 


B 


822494 


9071 


p 
r 


4100 1 i< 

425814 




3668 


B 


39322 


5263 


B 


824387 


9074 


p 

r < 


4*3 1 /IT A 

*3 1439 




3669 


B 


41205 


5264 


B 


823155 


907<; 

ZU / J 


p 


soften 
W9537 




3670 


B 


40161 


5265 


B 


B25057 


2076 


F ' 






JO / X 


n 
a 


A ft ft *-J A 

1Z U / 4 


5266 


B 1 


B24589 


2077 


F * 


130108 




3672 


B - 


40985 


5267 


B ( 


326509 


2078 


F < 


134401 




3673 


B - 


52887 


5268 


B t 


326122 


2079 


F < 


02485 




3674 


B - 


12284 


5269 


B I 


328041 



WO 99/28475 



254 



PCT/IB98/01939 



CPA TT\ 

aiLvj 113 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


2080 


r 


A "5 /AfA 

436050 




3675 


B 


44180 




5270 


B 


826572 


2081 


F 


434134 




3676 


B 


43337 




5271 


B 


828500 


2082 


F 


436342 




3677 


B 


45275 




5272 


B 


827246 


2083 


F 


434428 




3678 


B 


44022 




5273 


B 


829208 


2084 


F 


437672 




3679 


B 


45927 




5274 


B 


827489 


2Uoj 


r? 
T 


435775 




3680 


B 


45451 




5275 


B 


829407 


2086 


F 


A 1 O AC f 

438051 ■ 




3681 


B 


47375 




5276 


B 


828274 


TOOT 

208/ 


F 


436165 




3682 


B 


46308 




5277 


B 


830176 


2088 


F 


438767 




3683 


B 


48208 




5278 


B 


829495 


i Ann 

2089 


F 


436866 




3684 


B 


46560 


.... 


5279 


B 


831395 


2090 


F 


439139 




3685 


B 


48485 




5280 


B 


831192 


20S 1 


F 


437145 




3686 


B 


47115 




5281 


B 


833082 


20y2 


r 


439479 




3687 


B 


49053 




5282 


B 


832005 


t aai 


b 


437574 




3688 


B 


48208 




5283 


B 


833870 


AAA/1 

2094 


F 


440823 




3689 


B 


50118 




5284 


B 


832373 


2095 


F 


438923 




3690 


B 


48806 




5285 


B 


834280 


2096 


F 


441668 




3691 


B 


50708 




5286 


B 


832964 


AAA»J 

2097 


F 


439746 




3692 


B 


50333 




5287 


B 


834864 


2098 


F 


444271 




3693 


B 


52220 




5288 


B 


833672 


2099 


r 


442371 




3694 


B 


50960 




5289 


B 


835604 


1 1 AA 

2100 


F 


446233 




3695 


B 


52890 




5290 


B 


834022 


O 1 A1 

2101 


F 


444302 




3696 


B 


52660 




5291 


B 


835945 


1 1 A1 

2102 


F 


447687 




3697 


B 


54606 




5292 


B 


834907 


T 1 A1 

2103 


F 


445803 




3698 


B 


53737 




5293 


B 


836803 


2104 


F 


450318 




3699 


B 


55645 




5294 


B 


835683 


1 1 AC 

2103 


F 


448399 




3700 


B 


54793 




5295 


B 


837593 


11 A£ 

2100 


F 


450876 




3701 


B 


56691 




5296 


B 


836018 


1 1 A*7 

2107 


F 


449025 




3702 


B 


55329 




5297 


B 


837902 


1 1 AO 

2108 


F 


451274 




3703 


B 


57226 




5298 


B 


836714 


2109 


F 


449397 




3704 


B 


56691 




5299 


B 


838602 




r 






*5 H A C 

J / Ub 


B 


58566 




5300 


B 


837484 


2111 


F 


450513 




3706 


B 


56317 




5301 


B 


839424 


2112 


F 


453303 




3707 


B 


58226 




5302 


B 


838950 


2113 


F 


451427 




3708 


b 


58469 




5303 


B 


840832 



WO 99/28475 



255 



PCT7IB98/01939 



SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


2114 


F 


454713 




3709 


B 


60363 




5304 


B 


839800 


2115 


F 


452853 




3710 


B 


59406 




5305 


B 


841700 


2116 


F 


455096 




3711 


B 


61301 




5306 


B 


840808 


2117 


F 


453248 




3712 


B 


60450 




5307 


B 


842715 


2118 


F 


455808 




3713 


B 


62316 




5308 


B 


841975 


2119 


F 


453928 




3714 


B 


61722 




5309 


B 


843834 


2120 


F 


457440 




3715 


B 


63603 




5310 


B 


842679 


2121 


F 


455488 




3716 


B 


62585 




5311 


B 


844614 


2122 


F 


458282 




3717 


B 


64473 




5312* 


• B 


843342 


2123 


F 


456354 




3718 


B 


63362 


5313 


B 


845247 


2124 


F 


459558 


3719 


B 


65285 


5314 


B 


843736 


2125 


F 


457686 


3720 


B 


64203 


5315 


B 


845626 


2126 


F 


460960 


3721 • 


B 


66103 


5316 


B 


846423 


2127 


F 


459060 


3722 


B 


64850 


5317 


B 


848330 


2128 


F 


461659 


3723 


B 


66749 


5318 


B 


844423 


2129 


F 


459758 


3724 


B 


64899 


5319 


B 


846258 


2130 


F 


462674 


3725 


B 


66776 


5320 


B 


848265 


2131 


F 


460775 


3726 


B 


65164 


5321 


B 


850174 


2132 


F 


463788 


3727 


B 


67067 


5322 


B 


850343 


2133 


F 


461895 


3728 


B 


67514 


5323 


B 


852246 


2134 


F 


464479 


3729 


B 


69390 


5324 


B 


851875 


2I35 


F 


462602 


3730 


B 


69097 


5325 


B 


853765 


2136 


F 


465882 


3731 


B 


71033 


5326 


B 


853944 


2137 


F 


463989 


3732 


B 


69470 


5327 


B 


855822 


2138 


F 


467200 


3733 


B 


71354 




5328 


B 


855056 


2139 


F 


465300 


3734 


B 


69961 




5329 


B 


856964 


2140 


F 


468680 




3735 


B 


71898 




5330 


B 


855750 


2141 


F 


466787 




3736 


B 


70707 




5331 


B 


857714 


2142 


F 


469130 




3737 


B 


72579 




5332 


B 


856488 


2143 


F 


467224 




3738 


B 


71249 




5333 


B 


858370 




r 


4oy o / l 




niQ 

j / jy 


B 


73134 




5334 


B 


857403 


2145 


F 


467707 




3740 


B 


73458 




5335 


B 


B59303 


2146 


F 


470887 




3741 


B 


75376 




5336 


B 


B59659 


2147 


F - 


468984 




3742 


B 


75121 




5337 


B 


B61570 



WO 99/28475 



256 



PCT/IB98/01939 



ISLQ ID 


Or, 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or, 


position 


1 A O 

2148 


F 


471590 




3743 


B 


77020 




5338 


B 


860166 


2149 


F 


469835 




3744 


B 


75741 




5339 


B 


862082 


2150 


F 


473033 




3745 


6 


77608 




5340 


B 


860879 


01 < i 


F 


471133 




3746 


£ 


77447 




5341 


B 


862767 


o i *o 
Zl jz 


r 


473761 




3747 


B 


79300 




5342 


B 


861861 


01 <l 
Z 1 j j 


T? 

r 


471861 




3748 


B 


77535 




5343 


B 


863727 


2134 




A1 At Ol 

474383 




3749 


B 


79430 




5344 


B 


862284 


2135 


F 


472478 




3750 


B 


78333 




5345 


B 


864173 


2 1 jo 


r 


475333 


3751 


B 


80243 




5346 


B 


863792 


21 j / 


F 


AT> A /*f 

473465 


3752 


B 


79300 


5347 


B 


865660 


Z1S8 


F 


476279 


3753 


B 


81197 


5348 


B 


864164 


0 1 CO 


r 


474390 


3754 


B 


82201 


5349 


B 


866050 


210U 


T? 

r 


y| *70 yt yl 

478446 


3755 


B 


84100 


5350 


B 


864925 


o i £ i 
2101 


r? 

F 


A/6546 


3756 


B 


83493 


5351 


B 


866860 


0 1 /CO 

2102 


F 


478869 


3757 


B 


85405 


5352 


B 


866468 


O 1 /CO 

zlOi 


F 


476918 


3758 


B 


85075 


5353 


B 


868368 


2104 


F 


479441 


3759 


B 


86973 


5354 


B 


866860 


01 £C 


F 


477548 


3760 


B 


87141 


5355 


B 


868734 


01 « 
Z100 


F 


479676 


3761 


B 


89086 


5356 


B 


867952 


oi £n 

210/ 


r 


477775 


3762 


B 


88903 


5357 


B 


869878 


01 AC 


T? 

r 


481277 


3763 


B 


90810 


5358 


B 


869311 


01 AO 

zioy 


c 
r 


479377 


3764 


B 


88038 


5359 


B 


871226 


Zl /U 


r 


481635 




3765 


B 


89946 


5360 


B 


869582 


0171 
Zl / 1 


u 
r 


479745 




3766 


B 


88981 


5361 


B 


871517 


0 1 70 
Zl /Z 


T7 

r 


483172 




3767 


B 


90884 




5362 


B 


871614 


0171 
Z 1 / J 


r 


A0 1 onft 

481279 




3768 


B 


89485 




5363 


B 


873536 


0 1 7/1 
Z 1 /4 


c 
r 


484659 




3769 


B 


91385 




5364 


B 


872804 


017^ 
Zl / J 


T? 

r 


482764 




3770 


B 


89975 




5365 


B 


874702 


O 1 7/C 

21/0 


T? 

r 


A Of Aft^ 

485003 




3771 


B 


91855 




5366 


B 


873093 


O 1 *7T 

2177 


F 


483097 




3772 


B 


91680 




5367 


B 


874980 


2178 


r 


dR7Q4£ 




J / / J 


■Q 

D 


Q "3 C C —1 




5368 


B 


B74158 


2179 


F - 


486083 




3774 


B 


92008 




5369 


B 


B76056 


2180 


F - 


487946 




3775 


B 


93902 




5370 


B 


B74033 


2181 


F - 


486093 




3776 


B 


92633 




5371 


B 


B75932 



WO 99/28475 



257 



PCT/IB98/0I939 



SEQ ID 


Or. 


positi n 




SEQ ID 


Or, 


position 




SEQ ID 


Or. 


position 


2182 


F 


489220 




3777 


B 


94569 




5372 


B 


874670 


2183 


F 


487274 




3778 


B 


93390 




5373 


B 


876580 


2184 


F 


490276 




3779 


B 


95288 




5374 


B 


875606 


2185 


F 


488388 




3780 


B 


94354 




5375 


B 


877506 


2186 


F 


492138 




3781 


B 


96254 




5376 


B 


876619 


2187 


F 


490229 




3782 


B 


94897 




5377 


B 


878519 


2188 


F 


492475 




3783 


B 


96772 




5378 


B 


878160 


2189 


F 


490618 „ 


3784 


B 


98519 




5379 


B 


880034 


2190 


F 


493591 


3785 


B 


100439 


5380 


B 


878429 


2191 


F 


491719 


3786 


B 


98962 


5381 


B 


880292 


2192 


F 


494297 


3787 


B 


100853 


5382 


B 


879336 


2193 


F 


492436 


3788 


B 


100451 


5383 


B 


881230 


2194 


F 


494530 


3789 


B 


102387 


5384 


B 


879393 


2195 


F 


492679 


3790 


B 


101639 


5385 


B 


881281 


2196 


F 


494637 


3791 


B 


103473 


5386 


B 


880089 


2197 


F 


492753 


3792 


B 


102457 


5387 


B 


881994 


2198 


F 


495467 


3793 


B 


104357 


5388 


B 


881108 


2199 


F 


493559 


3794 


B 


102745 


5389 


B 


883062 


2200 


F 


496076 


3795 


B 


104666 


5390 


B 


882094 


2201 


F 


494126 


3796 


B 


104544 


5391 


B 


884013 


2202 


F 


497468 


3797 


B 


106464 


5392 


B 


883124 


2203 


F 


495569 


3798 


B 


105338 


5393 


B 


885027 


2204 


F 


498050 


3799 


B 


107237 


5394 


B 


884014 


2205 


F 


496160 


3800 


B 


106127 


5395 


B 


885924 


2206 


F 


498994 


3801 


B 


108014 


5396 


B 


884530 


2207 


F 


497096 


3802 


B 


108693 




5397 


B 


886432 


2208 


F 


500571 




3803 


B 


110587 




5398 


B 


885226 


2209 


F 


498671 




3804 


B 


109189 




5399 


B 


887139 


2210 


F 


501547 




3805 


B 


111079 




5400 


B 


889010 


2211 


F 


499671 




3806 


B 


109684 




5401 


B 


890900 


2212 


F 


502469 




3807 


B 


111580 




5402 


B 


890421 


2213 


F 


500572 




3808 


B 


110587 




5403 


B 


892310 


2214 


F 


503435 




3809 


B 


112486 




5404 


B 


891438 


2215 


F 


501547 




3810 


B 


112740 




5405 


B 


893287 



- WO 99/28475 
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SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or, 


position 


1 a 
zzlo 


F 


505469 




3811 


B 


114673 




5406 


B 


891703 


OO t "7 

221 / 


F 


C AO C A C 

503545 




3812 


B 


113684 




5407 


B 


893606 


22 lo 


r 


506768 




3813 


B 


115653 




5408 


B 


893606 


2219 


F 


504880 




3814 


B 


114216 




5409 


B 


895490 


2220 


F 


507356 




3815 


B 


116158 




5410 


B 


894049 


2221 


F 


505530 




3816 


B 


114836 




5411 


B 


896024 


2.2.1L 


r 


508015 




3817 


B 


116732 




5412 


B 


894139 


OOOl 

ZZZ;S 


r 


506157 




3818 


B 


115473 




5413 


B 


896074 


oooyi 
2224 


r 


C AO j* *7 

508247 




3819 


B 


117380 




5414 


B 


894545 


2225 


r 


506351 




3820 


B 


115898 




5415 


* B 


896413 


2226 


F 


509270 


3821 


B 


117797 


5416 


B 


894999 


2227 


F 


507356 


3822 


B 


120031 


5417 


B 


896912 


2228 


F 


510759 


3823 


B 


121926 


5418 


B 


896127 


222V 


F 


508918 


3824 


B 


124231 


5419 


B 


898012 


OOIA. 
ZZJU 


r 


51 1268 


3825 


B 


126158 


5420 


B 


897049 


2xo 1 


r 


CAAO ce\ 

509359 


3826 


B 


125215 


5421 


B 


898949 


ZZjZ 


T7 

r 


512124 


3827 


B 


127115 


5422 


B 


901018 


2233 


F 


51 0202 


3828 


B 


125352 


5423 


B 


902955 


2234 


F 


512836 


3829 


B 


127271 


5424 


B 


902393 


ioi c 
223D 


F 


510926 


3830 


B 


126492 


5425 


B 


904301 


2z3o 


T 

r 


514569 


3831 


B 


128390 


5426 


B 


904098 


0017 
ZZ J I 


T? 

r 


512663 


3832 


B 


127150 


5427 


B 


906002 


001ft 
ZZJo 


T? 

r 


C 1 A COG 

514688 


3833 


B 


129050 


5428 


B 


903951 


0010 


r 


512874 




3834 


B 


128010 




5429 


B 


905851 


OOyl A 

224U 


F 


515334 




3835 


B 


129905 




5430 


B 


905825 


O0>1 1 

2241 


F 


513415 




3836 


B 


129947 




5431 


B 


907725 


OO/l 0 

2242 


F 


515516 




3837 


B 


131873 




5432 


B 


906700 


ZZ4j 


F 


513670 




3838 


B 


131255 




5433 


B 


908669 


OOvl /I 

2244 


F 


516230 




3839 


B 


133205 




5434 


B 


907174 


OO/t c 

224j 


T? 

F , 


514348 




3840 


B 


131873 




5435 


B 


909066 


2246 




5171 81 

Jit 101 




jOtl 


n 

a 


ij j / *± y 




5436 


B 


907579 


2247 


F i 


515261 




3842 


B 


133381 




5437 


B 


909480 


2248 


F i 


517840 




3843 


B 


135305 




5438 


B 


908962 


2249 


F '. 


515977 




3844 


B 


133804 




5439 


B 


910922 



WO 99/28475 
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PCT/IB98/01939 



SEQID 


Or, 


position 


2250 


F 


518184 


2251 


F 


516283 


2252 


F 


518704 


2253 


F 


516804 


2254 


F 


519358 


2255 


F 


517404 


2256 


F 


520483 


2257 


F 


518598 


2258 


F 


521632 


2259 


F 


519765 


2260 


F 


521919 


2261 


F 


520024 


2262 


F 


523372 


2263 


F 


521495 


2264 


F 


524404 


2265 


F 


522508 


2266 


F 


525498 


2267 


F 


523600 


2268 


F 


525687 


2269 


F 


523753 


2270 


F 


526036 


2271 


F 


524164 


2272 


F 


526632 


2273 


F 


524732 


2274 


F 


528073 


2275 


F 


526158 


2276 


F 


528835 


2277 


F 


526999 


2278 


F 


529393 


2279 


F 


527504 


2280 


F 


530004 


2281 


F 


528082 


2282 


F 


530518 


2283 


F 


528594 



SEQ ID 


Or. 


position 


3845 


B 


135704 


3846 


B 


134219 


3847 


B 


136133 


3848 


B 


134729 


3849 


B 


136633 


3850 


B 


137694 


3851 


B 


139583 


3852 


B 


140496 


3853 


B 


142396 


3854 


B 


140953 


3855 


B 


142856 


3856 


B 


142031 


3857 


B 


143950 


3858 


B 


143520 


3859 


B 


145425 


3860 


B 


144066 


3861 


B 


145967 


3862 


B 


144629 


3863 


B 


146519 


3664 


B 


146547 


3865 


B 


148446 


3866 


B 


147304 


3867 


B 


149227 


3868 


B 


148296 


3869 


B 


150255 


3870 


B 


148769 


3871 


B 


150670 


3872 


B 


149320 


3873 


B 


151168 


3874 


B 


149532 


3875 


B 


151470 


3876 


B 


149934 


3877 


B 


151845 


3878 


B 


151168 



SEQ ID 


Or. 


position 


5440 


B 


909415 


5441 


B 


911315 


5442 


B 


909804 


5443 


B 


911704 


5444 


B 


910218 


5445 


B 


912075 


5446 


B 


910834 


5447 


B 


912689 


5448 


B 


911225 


5449 


B 


913064 


5450 


B 


912492 


5451 


B 


914381 


5452 


B 


912852 


5453 


B 


914763 


5454 


B 


913438 


5455 


B 


915340 


5456 


B 


913901 


5457 


B 


915814 


5458 


B 


914730 


5459 


B 


916694 


5460 


B 


915356 


5461 


B 


917256 


5462 


B 


915717 


5463 


B 


917608 


5464 


B 


916854 


5465 


B 


918761 


5466 


B 


916762 


5467 


B 


918712 


5468 


B 


917848 


5469 


B 


919756 


5470 . 


B 


918588 


5471 


B 


920424 


5472 


B 


918875 


5473 


B 


920796 
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SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


2284 


F 


532100 




3879 


B 


153047 




5474 


B 


919756 


228S 


F 


530224 




3880 


B 


152275 




5475 


B 


921710 


2286 


F 


533025 




3881 


B 


154165 




5476 


B 


920055 


2287 


F 


531125 




3882 


B 


152887 




5477 


B 


921949 


2288 


F 


533710 




3883 


B 


154804 




5478 


B 


920389 


2289 


F 


531825 




3884 


B 


153600 




5479 


B 


922328 


2290 


F 


534658 




3885 


B 


155482 




5480 


B 


921130 


2291 


F 


532758 




3886 


B 


154217 




5481 


B 


922978 


2292 


F 


535737 




3887 


B 


156157 




5482 


B 


921517 


2293 


F 


533828 




3888 


B 


155523 


5483 


B 


923414 


2294 


F 


539456 


3889 


B 


157396 


5484 


B 


921740 


2295 


F 


537568 


3890 


B 


156823 


5485 


B 


923646 


2296 


F 


540290 


3891 


B 


158731 


5486 


B 


921979 


2297 


F 


538375 


3892 


B 


157745 


5487 


B 


923926 


2298 


F 


540672 


3893 


B 


159645 


5488 


B 


922396 


2299 


F 


538777 


3894 


B 


159207 


5489 


B 


924327 


2300 


F 


541573 


3895 


B 


161084 


5490 


B 


922729 


2301 


F 


539706 


3896 


B 


160216 


5491 


B 


924611 


2302 


F 


542999 


3897 


B 


162127 


5492 


B 


923256 


2303 


F 


541102 


3898 


B 


161451 


5493 


B 


925216 


2304 


F 


543922 


3899 


B 


163350 


5494 


B 


923673 


2305 


F 


542057 


3900 


B 


161818 


5495 


B 


925589 


2306 


F 


544268 


3901 


B 


163726 


5496 


B 


924297 


2307 


F 


542354 


3902 


B 


162601 


5497 


B 


926176 


2308 


F 


544691 




3903 


B 


164501 


5498 


B 


924645 


2309 


F 


542785 




3904 


B 


163282 




5499 


B 


926555 


2310 


F 


546323 


3905 


B 


165205 




5500 


B 


925216 


23 ll 


F 


544441 


3906 


B 


164171 




5501 


B 


927117 


2312 


F 


546467 




3907 


B 


166082 




5502 


B 


925589 


2313 


F 


544650 




3908 


B 


165812 




5503 


B 


927489 


^314 


F 


547257 




3909 


B 


167672 




5504 


B 


926980 


2315 


F 


545357 




3910 


B 


166939 




5505 


B 


928903 


2316 


F 


547658 




3911 


B 


168861 




5506 


B 


927408 


2317 


F 


545781 




3912 


B 


168421 




5507 


B 


929331 
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OM*j\2 ID 


Or. 


position 




SEQ ID 


Or. 


positi n 


SEQ ID 


Or. 


position 


071 c 


r 


548406 




3913 


B 


170322 


5508 


B 


927824 


071 Q 


r 


546466 




3914 


B 


169503 


5509 . 


B 


929727 


07on 


1? 

r 


549443 




O ft 1 C 

3915 


B 


171369 


5510 


B 


928418 


070 1 
Oil 


r 


547527 




3916 


B 


170715 


5511 


B 


930287 




r 


55U193 




3917 


B 


172587 


5512 


B 


928979 


0707 


17 

r 


548288 




3918 


B 


171554 


5513 


B 


930879 


070,4 


t7 

r 


551554 




3919 


B 


173512 


5514 


B 


930474 


070^ 


c 
r 


549665 




3920 


B 


172596 


5515 


B 


932367 


070A 


T7 

r 


552877 




3921 


B 


174496 


5516 


B 


930879 


0707 


T7 

r 


550963 




3922 


B 


173021 


5517 


B 


932776 


0100 
Zj28 


17 

r 


554708 




3923 


B 


174847 


5518 


B 


931298 


070Q 


T7 

r 


552807 


3924 


B 


174716 


5519 


B 


933223 


077A 


17 

r 


556293 


3925 


B 


176619 


5520 


B 


931886 


077 1 


T7 

r 


554381 


3926 


B 


175205 


5521 


B 


933812 


0770 


17 

r 


557095 


3927 


B 


177143 


5522 


B 


932187 


0777 


17 

r 


555164 


3928 


B 


175943 


5523 


B 


934117 


0774 


17 

r 


557522 


3929 


B 


177830 


5524 


B 


932803 


077^ 


17 

r 


c c c £. e e 

555656 


3930 


B 


177244 


5525 


B 


934733 


077£ 


r 


558021 


3931 


B 


179180 


5526 


B 


933421 


0777 


r 


556191 


3932 


B 


178337 


5527 


B 


935352 




077fi 


T7 


558964 


3933 


B 


180239 


5528 


B 


933841 




077Q 


r 


557076 


3934 


B 


180274 


5529 


B 


935758 






i? 
r 


560913 




3935 


B 


182181 


5530 


B 


934062 




0741 


T? 

r 


559007 




3936 


B 


180972 


5531 


B 


935933 




0740 


17 

r 


562832 




3937 


B 


182864 


5532 


B 


934794 




0747 


r 


561002 




3938 


B 


181659 


5533 


B 


936682 






17 

r 


563374 




3939 


B 


183572 


5534 


B 


936606 




074^ 


17 

r 


561474 




3940 


B 


182864 


5535 


B 


938521 




074A 


T7 

r 


565061 




3941 


B 


184750 


5536 


B 


938373 




0747 


T? J 

r : 


563198 




3942 


B 


183876 


5537 


B 


940324 




2348 


r 


s6AR4H 




7 OA 1 


B 


185778 


5538 


B 


939321 




2349 


F i 


564908 




3944 


b ; 


185007 


5539 


B ! 


941229 




2350 


F < 


567802 




3945 


B : 


L86910 


5540 


B < 


941153 




2351 


F ; 


56S896 




3946 


b : 


L86246 


5541 


B < 


543053 
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SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


2352 


F 


571045 




3947 


B 


188160 




5542 


B 


942291 


2353 


F 


569137 




3948 


B 


187080 




5543 


B 


944212 


2354 


F 


572300 




3949 


B 


189016 




5544 


B 


944964 


2355 


F 


570362 




3950 


B 


187490 




5545 


B 


946809 


2356 


F 


572512 




3951 


B 


189421 




5546 


B 


945527 


2357 


F 


570612 




3952 


B 


188044 




5547 


B 


947426 


2358 


F 


572627 




3953 


B 


189929 




5548 


B 


946546 


2359 


F 


570753 




3954 


B 


189711 




5549 


B 


948430 


OKA 


h 


573416 




3955 


B 


191656 


5550 


B 


946896 


2301 


F 


571539 




3956 


B 


190594 


5551 


B 


948823 


2362 


F 


573964 


3957 


B 


192533 


5552 


B 


948677 


2363 


F 


572065 


3958 


F 


190731 


5553 


B 


950581 


2364 


F 


574221 


3959 


B 


192679 


5554 


B 


949505 


2365 


F 


572303 


3960 


B 


192027 


5555 


B 


951398 


2366 


F 


574704 


3961 


B 


193906 


5556 


B 


949834 


230/ 


F 


572781 


3962 


B 


192684 


5557 


B 


951743 


23oo 


F 


574795 


3963 


B 


194643 


5558 


B 


950897 


2369 


F 


572936 


3964 


B 


193419 


5559 


B 


952796 


2370 


F 


575229 


3965 


B 


195348 


5560 


B 


951550 


2371 


F 


573350 


3966 


B 


194878 


5561 


B 


953534 


**** 1 *70 

2372 


F 


576318 


3967 


B 


196750 


5562 


B 


951870 


2373 


F 


574384 


3968 


B 


195270 


5563 


B 


953763 




r 


577796 




3969 


B 


197166 




5564 


B 


953037 


Id i j 


r 


575893 




3970 


B 


197255 




5565 


B 


954930 


23 /0 


F 


578774 




3971 


B 


199165 




5566 


B 


954509 


23 // 


F 


576870 




3972 


B 


197859 




5567 


B 


956384 


2378 


F 


581976 




3973 


B 


199756 




5568 


B 


955397 


T17rt 

2379 


F 


580073 




3974 


B 


198443 




5569 


B 


957278 


ii on 

2380 


F 


582589 




3975 


B 


200328 




5570 


B 


958191 


2381 


F 


580679 




3976 


B 


199202 




5571 


B 


960060 


2382 


F 






/ / 


D 


OAT f\Q A 




5572 


B 


959570 


2383 


F 


581060 




3978 


B 


200198 




5573 


B 


961432 


2384 


F . 


583210 




3979 


B , 


202084 




5574 


B 


961273 


2385 


F . 


581316 




3980 


b : 


200547 




5575 


B 


963183 



- WO 99/28475 
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acjyi LU 


Ur. 


position 




SEQ ZD 


Or . 


position 




SEQ ID 


Or. 


position 


23 00 


r 


COi C£A 

583554 




"5 ft O 1 

3981 


B 


202447 




5576 


B 


961837 


OO 01 

2387 


F 


COI /TOT 

581697 




39B2 


£ 


n f\ a fs. +s a 

200934 




5577 


B 


963749 


2388 


F 


584048 




3983 


B 


202806 




5578 


B 


962823 


OO Oft 

2389 


F 


582179 




3984 


B 


201224 




5579 


B 


964695 


o*^ c\n 

2390 


F 


584797 




3985 


B 


203115 




5580 


B 


962199 


2391 


T? 

r 


582897 




3986 


B 


202606 




5581 


B 


964099 


oo. oo 
2392 


r 


584978 




3987 


B 


204583 




5582 


B 


965286 


2393 


r 


C OO ft*) 1 

583031 




3988 


B 


203735 




5583 


B 


967118 


Olft/1 

2394 


r 


586256 




3989 


B 


205603 




5584 


B 


965928 


2395 


T7 

r 


584320 




3990 


B 


204218 




5585 


B 


967828 


2396 


F 


587527 




3991 


B 


206147 


5586 


B 


967261 


239/ 


r? 

r 


585631 


3992 


B 


206686 


5587 


B 


969179 


239o 


rr 

r 


COO 

588892 


3993 


B 


208569 


5588 


B 


968345 




c 
r 


CQOACO 

587052 


3994 


B 


207672 


5589 


B 


970181 


Z4UU 


■n 
r 


com ^o 

589362 


3995 


B 


209558 


5590 


B 


969123 


24U1 


T7 

r 


COO A CI 

587452 


3996 


B 


208799 


5591 


B 


971023 


o/t no 
24U2 


r 


CftftC 1 /" 

590516 


3997 


B 


210648 


5592 


B 


970973 


O/l AO 

24U3 


F 


588616 


3998 


B 


209701 


5593 


B 


972849 


0/1 A/1 

24U4 


F 


592168 


3999 


B 


211599 


5594 


B 


972081 


24U5 


F 


590235 


4000 


B 


210500 


5595 


B 


973933 


O/l A< 

24UO 


r? 

r 


592629 


4001 


B 


212364 


5596 


B 


973279 


Oyl AO 

24U/ 


F 


590708 


4002 


B 


211064 




5597 


B 


975173 


24Uo 


T? 

r 


CftOAA 1 

592901 


4003 


B 


212957 




5598 


B 


974803 


Z4Uy 


T? 

r 


590973 




4004 


B 


211557 




5599 


B 


976683 


O/i i n 
24 1U 


r 


593863 




4005 


B 


213453 




5600 


B 


975459 


Z41 1 


r 


Cft 1 Aftft 

591990 




4006 


B 


213248 




5601 


B 


977346 


oa i o 

x4lz 


r 


CAC >f 1 ^ 

595416 




4007 


B 


215101 




5602 


B 


976431 


Z41 J 


r? 

r 


cno AO A 

593484 




4008 


B 


214372 




5603 


B 


978335 


0/11 /I 
Z414 


F 


595935 




4009 


B 


216286 




5604 


B 


976740 


O/I 1 c 
2413 


F 


593962 




4010 


B 


214931 




5605 


B 


978640 


2416 


F 


J7» UZ J 




4m i 

1U1I 




01 C Q Q n 

c -Lb b o / 




5606 


B 


977175 


2417 


F . 


595158 




4012 


B 


215375 




5607 


B 


979042 


2418 


F . 


598254 




4013 


B , 


217268 




5608 


B 


977855 


. 2419 


F i 


596402 




4014 


B . 


216928 




5609 


B 


979768 



WO 99/28475 
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SEQID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


2420 


F 


600515 




4015 


B 


218828 


5610 


B 


978153 


2421 


F 


598575 




4016 


B 


218137 


5611 


B 


980060 


2422 


F 


601691 




4017 


B 


220037 


5612 


B 


$78655 


2423 


F 


599745 


4018 


B 


217661 


5613 


B 


980553 


2424 


F 


603291 


4019 


B 


219561 


5614 


B 


979204 


2425 


F 


601376 


4020 


B 


218937 


5615 


B 


981104 


2426 


F 


603538 


4021 


B 


220822 


5616 


B 


979554 


2427 


F 


601638 


4022 


B 


219229 


5617 


B 


981465 


2428 


F 


604263 


4023 


B 


221130 


5618 


B 


981423 


2429 


F 


602388 


4024 


B 


220005 


5619 


B 


983319 


2430 


F 


605359 


4025 


B 


221868 


5620 


B 


980363 


2431 


F 


603425 


4026 


B 


220654 


5621 


B 


982289 


2432 


F 


606622 


4027 


B 


222560 


5622 


B 


982361 


2433 


F 


604721 


4028 


B 


221294 


5623 


B 


984236 


2434 


F 


607722 


4029 


B 


223195 


5624 


B 


983818 


2435 


F 


605908 


4030 


B 


221762 


5625 


B 


985718 


2436 


F 


609039 


4031 


B 


223673 


5626 


B 


984720 


2437 


F 


607143 


4032 


B 


222338 


5627 


B 


986608 


2438 


F 


609842 


4033 


B 


224212 


5628 


B 


985607 


2439 


F 


607958 


4034 


B 


223510 


5629 


B 


987553 


2440 


F 


611565 


4035 


B 


225417 


5630 


B 


986323 


2441 


F 


609657 


4036 


B 


224094 


5631 


B 


988223 


2442 


F 


612719 


4037 


B 


226033 


5632 


B 


986925 


2443 


F 


610819 


4038 


B 


225417 


5633 


B 


988825 


2444 


F 


613473 


4039 


B 


227291 


5634 


B 


987850 


2445 


F 


611565 


4040 


B 


226033 


5635 


B 


989749 


2446 


F 


614096 


4041 


B 


227907 


5636 


B 


988354 


2447 


F 


612172 


4042 


B 


225811 


5637 


B 


990252 


2448 


F 


615072 


4043 


B 


227671 


5638 


B 


988474 


2449 


F 


613207 


4044 


B 


226947 




5639 


B 


990393 


2450 


F 


615645 




4045 


B 


228871 




5640 


B 


989437 


2451 


F 


613745 




4046 


B 


227523 




5641 


B 


991305 


2452 


F 


617266 




4047 


B 


229432 




5642 


B 


993662 


2453 


F 


615347 




4048 


B 


227907 




5643 


B 


995596 
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SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


2454 


F 


618142 




4049 


B 


229784 




5644 


B 


994015 


2455 


F 


616242 




4050 


B 


228692 




5645 


B 


995906 


Ovt C/I 

2456 


F 


619019 




4051 


B 


230578 




5646 


B 


994084 


O/l C*7 

2457 


F 


617128 




4052 


B 


230253 




5647 


B 


995955 


O/l CO 

2458 


T? 

r 


619637 




4053 


B 


232157 




5648 


B 


994523 


O/l CO 

2459 


r 


617695 




4054 


B 


231299 




5649 


B 


996382 


2460 


F 


620182 




4055 


B 


233194 




5650 


B 


995903 


2461 


F 


618257 




4056 


B 


233226 




5651 


B 


997791 


2462 


F 


620929 




4057 


B 


235130 




5652 


B 


996831 


O/l /CO 

2463 


F 


619019 




4058 


B 


234073 




5653 


B 


998764 


o/i^/i 
2464 


r 


621446 


4059 


B 


235950 




5654 


B 


998505 


0/1 £C 

2465 


r 


619529 


4060 


B 


234510 


5655 


B 


1000417 


2466 


F 


622195 


4061 


B 


236399 


5656 


B 


999445 


2467 


r 


620292 


4062 


B 


235094 


5657 


B 


1001345 


2468 


F 


623244 


4063 


B 


236993 


5658 


B 


1001253 


2469 


F 


621446 


4064 


B 


236552 


5659 


B 


1003169 


24 /U 


r 


623834 


4065 


B 


238440 


5660 


B 


1001751 


0/171 
24/1 


r 


621954 


4066 


B 


238440 


5661 


B 


1003637 


24/2 


F 


625024 


4067 


B 


240381 


5662 


B 


1001954 


2473 


F 


623208 


4068 


B 


238989 


5663 


B 


1003786 


2474 


F 


626291 


4069 


B 


240917 


5664 


B 


1004003 


2475 


F 


624369 


4070 


B 


240294 


5665 


B 


1005872 


24 /O 


T? 

r 




4071 


B 


242181 


5666 


B 


1005114 


0/177 
Z4/ / 


T7 

r 


624821 




4072 


B 


242260 


5667 


B 


1006999 




T? 

r 


627597 




4073 


B 


244157 




5668 


B 


1005620 




r 


625703 




4074 


B 


243066 




5669 


B 


1007561 


24 oU 


F 


627927 




4075 


B 


245029 




5670 


B 


1007761 


O/l 0 1 

2481 


F 


626025 




4076 


B 


244703 




5671 


B 


1009618 


O/l oo 

2482 


F 


628712 




4077 


B 


246603 




5672 


B 


1008052 


o/i ca 
2483 


F 


626787 




4078 


B 


246151 




5673 


B 


1009941 


2484 


r 


fv?QCoc 




•a u / y 


a 


a a f\ t *r 
^48017 




5674 


B 


1008954 


2485 


F ( 


627685 




4080 


B 


247104 




5675 


B 


1010818 


2486 


F I 


530055 




4081 


B 


248997 




5676 


B 


1009679 


2487 


F 1 


528178 




4082 


B 


248001 




5677 


B 


1011621 
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SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


2488 


F 


630672 




4083 


B 


249872 




5678 


B 


1012274 


2489 


F 


628789 




4084 


B 


248835 




5679 


B 


1014193 


2490 


F 


637125 




4085 


B 


250713 




5680 


B 


1012682 


2491 


F 


635224 




4086 


B 


249697 




5681 


B 


1014550 


2492 


F 


637807 




4087 


B 


251574 




5682 


B 


1012855 


2493 


F 


635913 




4088 


B 


250643 




5683 


B 


1014755 


2494 


F 


639960 




4089 


B 


252560 




5684 


B 


1013104 


2495 


F 


638102 




4090 


B 


251439 




5685 


B 


1014996 


2496 


F 


639960 




4091 


B 


253306 


5686 


B 


1013698 


2497 


F 


638012 




4092 


B 


252401 




5687 


B 


1015571 


2498 


F 


643252 




4093 


B 


254231 




5688 


B 


1014289 


2499 


F 


641355 


4094 


B 


253312 


5689 


B 


1016181 


2500 


F 


644146 


4095 


B 


255224 


5690 


B 


1014730 


2501 


F 


642204 


4096 


B 


256969 


5691 


B 


1016653 


2502 


F 


645175 


4097 


B 


258840 


5692 


B 


1015459 


2503 


F 


643275 


4098 


B 


258065 


5693 


B 


1017377 


2504 


F 


645519 


4099 


B 


259964 


5694 


B 


1016272 


2505 


F 


643652 


4100 


B 


258945 


5695 


B 


1018122 


2506 


F 


646869 


4101 


B 


260826 


5696 


B 


1017377 


2507 


F 


644983 


4102 


B 


259428 


5697 


B 


1019285 


2508 


F 


649714 


4103 


B 


261313 


5698 


B 


1018043 


2509 


F 


647800 


4104 


B 


261845 


5699 


B 


1019888 


2510 


F 


650199 


4105 


B 


263746 




5700 


B 


1019146 


2511 


F 


648260 




4106 


B 


264350 




5701 


B 


1021064 


2512 


F 


651421 




4107 


B 


266263 




5702 


B 


1019421 


2513 


F 


649536 




4108 


B 


264963 




5703 


B 


1021323 


2514 


F 


652285 




4109 


B 


266887 




5704 


B 


1020440 


2515 


F 


650397 




4110 


B 


265848 




5705 


B 


1022319 


2516 


F 


652562 




4111 


B 


267749 




5706 


B 


1021269 


2517 


F < 


550653 




4112 


B 


266644 




5707 


B 


1023161 




r ( 


D59653 




4113 


B 


268546 




5708 


B 


1021789 


2519 


F ( 


557741 




4114 


B 


268188 




5709 


B 


1023722 


2520 


F ( 


561449 




4115 


B 


270091 




5710 


B 


1022638 


2521 


F < 


559621 




4116 


B 


268188 




5711 


B 


L024535 



WO 99/28475 
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CI?/"* IX* 


Or. 


position 




SEQ ID 


Or. 


position 


SEQ ID 


Or. 


position 


9^99 

zozz 


P 

r 


662055 




4117 


B 


270091 


5712 


B 


1023900 


9^91 
ZjZj 


p 


66U167 




4118 


B 


268546 


5713 


B 


1025787 


ZjZ4 


P 

r 


662726 




4119 


B 


270440 


5714 


B 


1024169 


9<9^ 
ZDZj 


p 
r 


660831 




4120 


B 


268785 


5715 


B 


1026083 


9<9£ 
Z3Z0 


T? 

r 


663526 




4121 


B 


270697 


5716 


B 


1024996 


ZjZ / 


r 


if ni 

661591 




4122 


B 


270439 


5717 


B 


1026938 


Z3Z5 


P 

r 


664035 




4123 


B 


272288 


5718 


B 


1025295 


ZJZ7 


r 


06Z 1 34 




4124 


B 


271181 


5719 


B 


1027165 


Zj 


p 


o6j!)U4 




4125 


B 


273043 


5720 


B 


1026136 


Zj_5 1 


P 

r 


663591 




4126 


B 


271833 


5721 


B 


1028031 


9^79 


T7 

r 


665530 




4127 


B 


273738 


5722 


B 


1026823 


9^TJ 
Z3JJ 


T7 

r 


663663 


4128 


B 


273188 


5723 


E 


1028699 


9^'5vd 
ZD j4 


T7 

r 


666606 


4129 


B 


275062 


5724 


B 


1027642 


9^7^ 
ZjOj 


P 

r 


664/05 


4130 


B 


273817 


5725 


B 


1029524 




p 
r 


oo /yoi 


4131 


B 


275718 


5726 


B 


1030824 


9^7 


p 
r 


00j9o/ 


4132 


B 


275062 


5727 


B 


1032737 


9f*R 
ZJJS 


P 

r 


66/953 


4133 


B 


277005 


5728 


B 


1033510 


9^70 

zjjy 


r 


666037 


4134 


B 


275886 


5729 


B 


1035372 


9^n 

ZD4U 


T? 

r 


668397 


4135 


B 


277857 


5730 


B 


1032306 


Z.341 


17 

r 


666496 


4136 


B 


277923 


5731 


B 


1034209 


9^49 
ZJfZ 


p 
r 


</CAT5 C 

669ZJ5 


4137 


B 


279868 


5732 


B 


1037275 


9^n 


r 


uo/366 




4138 


B 


279043 


5733 


B 


1162 




p 
r 






4139 


B 


281004 


5734 


B 


1036130 


9^4^ 


p 
r 


667879 




4140 


B 


280126 


5735 


B 


1038037 


9^4£ 
Z 0 


r 


670513 




4141 


B 


282004 


5736 


B 


1036727 


9^47 


p 
r 


668612 




4142 


B 


281004 


5737 


B 


149 


9^48 


p 
r 


□70963 




4143 


B 


282911 


5738 


B 


225 


9^40 


p 
r 


□69058 




4144 


B 


281825 


5739 


B 


2104 


ZJ JU 


p 
r 


□72473 




4145 


B 


283746 


5740 


B 


17209 


9^1 
ZJ j 1 


p j 


570573 




4146 


B 


282335 


5741 


B ' 


19109 


2552 




^79052^ 
J/ZyOJ 




A 1 /i *7 
-414 / 


B 


284236 


5742 


B ; 


32032 


2553 


F ( 


571072 




4148 


B , 


284583 


5743 


b : 


J3899 


2554 


F < 


574381 




4149 


B , 


286455 


5744 


B I 


57057 


2555 


F ( 


572493 




4150 


b : 


285055 


5745 


B ! 


58954 
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cun in 
1U 


ur. 


position 




SEQ ID 


Or, 


position 




SEQ ID 


Or. 


position 


9^A 


P 

r 


<7^AOQ 

o / juzy 




4151 


B 


286921 




5746 


B 


57141 


9^7 

ZJ J 1 


p 
r 


<71 1 




A T C O 

41bZ 


B 


285894 




5747 


B 


59033 


Zj jo 


p 
r 


0/OZj4 




4153 


B 


287794 




5748 


B 


65406 


9^0 


r 


X7/f1 C7 




VI T C A 

4154 


B 


286441 




5749 


B 


67210 


ZjOU 


r 


Kin a oi 
0/ /4ol 




A T C C 

4155 


B 


288315 




5750 


B 


73871 


ZJO 1 


P 

r 


o/jjjy 




vl 1 C 

4156 


B 


286921 




5751 


B 


75741 


ZjOZ 


r 


0/8423 




4157 


B 


288811 




5752 


B 


78956 


ZjOj 


p 


o/ojoU 




A 1 C Q 

4158 


B 


288145 




5753 


B 


80903 


ZJO*f 


p 
r 


o /yu4y 




4159 


B 


290018 




5754 


B 


80042 


ZJOJ 


p 


O/ / 1 JO 




4160 


B 


289448 




5755 


B 


81944 


ZjOO 


p 


OoUjoU 


4161 


B 


291407 


5756 


B 


82642 


9^£7 
ZjO / 


p 


0/o4z3 


4162 


B 


290899 


5757 


B 


84491 


ZjOo 


p 


Oolzj / 


4 16o 


B 


292805 


5758 


B 


87820 


ZJO7 


p 
r 


o /y4zu 


4 164 


B 


291814 


5759 


B 


89658 




p 
r 


oozzoi 


4165 


B 


293655 


5760 


B 


110184 


9^71 

Z J / 1 


r 


0oU4jj 


4166 


B 


292489 


5761 


B 


112086 


9^79 
ZJ / Z 


r 


OoZo /U 


4167 


B 


294373 


5762 


B 


111873 


9^7^ 

Zj / J 


p 


OolUlz 


4168 


B 


293317 


5763 


B 


113837 


Zj /*f 


p 
r 


054147 


4169 


B 


295183 


5764 


B 


112302 


9^7^ 
ZJ / J 


p 
r 


oozzol 


4170 


B 


295073 


5765 


B 


114206 


Zj /o 


p 
r 


oo4joz 


4171 


B 


297037 


5766 


B 


113165 


9S77 
Z J / / 


p 


00ZO04 


4172 


B 


295932 




5767 


B 


115093 


9S7R 
Z J / o 


p 


oojy /o 




4173 


B 


297846 




5768 


B 


114270 


9^7Q 
ZJ /J* 


p 


/co/iAin 
Oo4UJj 




4174 


B 


296556 




5769 


B 


116158 


ZJOU 


p 


DO / lzl 




4175 


B 


298475 




5770 


B 


121039 


ZJO 1 


p 
r 


DO J loO 




4176 


B 


297474 




5771 


B 


122904 


9^R9 
ZJOZ 


p 


oo/y /4 




4177 


B 


299413 




5772 


B 


125742 


95Jtt 


p 


D00U44 




4178 


B 


298970 




5773 


B 


127643 


XJO*T 


r ( 


DOOloy 




4179 


B 


300855 




5774 


B 


132170 


xjoj 


r ( 


Ejooj Ij 




4180 


B 


300679 




5775 


B 


134028 


ZJoO 


p i 

r i 






4181 


B 


302595 




5776 


B 


144647 


2587 


F < 


S87511 




4182 


B . 


302372 




5777 


B 


146547 


2588 


F < 


389580 




4183 


B ; 


304272 




5778 


B 


150960 


2589 


F < 


387727 




4184 


b : 


305137 




5779 


B 


L52837 



WO 99/28475 
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SEQID 


Or, 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


2590 


F 


690204 




4185 


B 


307039 




5780 


B 


164761 


2591 


F 


688342 




4186 


B 


306377 




5781 


B 


166686 


2592 


F 


690431 




4187 


B 


308287 




5782 


B 


166362 


2593 


F 


688497 




418B 


B 


306730 




5783 


B 


168305 


If Hit 

2594 


F 


691790 




4189 


B 


308614 




5784 


B 


168970 


2595 


F 


689919 




4190 


B 


307199 




5785 


B 


170889 


r\ c r\ s~ 

2596 


F 


693614 




4191 


B 


309120 




5786 


B 


171056 


2597 


F 


691704 




4192 


B 


309018 




5787 


B 


173021 


2598 


F 


694723 




4193 


B 


310903 




5788 


B 


177747 


2599 


F 


692821 


4194 


B 


310128 


5789 


B 


179629 


2600 


F 


696922 


4195 


B 


312001 


5790 


B 


188605 


2601 


F 


695033 


4196 


B 


310966 


5791 


B 


190552 


2602 


F 


697714 


4197 


B 


312899 


5792 


B 


189016 


2603 


F 


695816 


4198 


B 


311790 


5793 


B 


190924 


2604 


F 


698510 


4199 


B 


313705 


5794 


B 


190871 


2605 


F 


696612 


4200 


B 


312671 


5795 


B 


192749 


2606 


F 


700037 


4201 


B 


314590 


5796 


B 


197533 


2607 


F 


698119 


4202 


B 


314590 


5797 


B 


199449 


2608 


F 


700691 


4203 


B 


316484 


5798 


B 


211604 


2609 


F 


698783 


4204 


B 


314977 


5799 


B 


213554 


2610 


F 


701885 


4205 


B 


316880 


5800 


B 


235455 


2611 


F 


699984 


4206 


B 


315775 


5801 


B 


237385 


2612 


F 


703303 




4207 


B 


317646 


5802 


B 


237448 


2613 


F 


701403 




4208 


B 


316760 


5803 


B 


239387 


2614 


F 


704791 




4209 


B 


318627 




5804 


B 


250266 


26I5 


F 


702877 




4210 


B 


317541 




5805 


B 


252155 


2616 


F 


705452 




4211 


B 


319422 




5806 


B 


253731 


2617 


F 


703584 




4212 


B 


317829 




5807 


B 


255663 


2618 


F 


705918 




4213 


B 


319763 




5808 


B 


255115 


2619 


F 


704019 




4214 


B 


318703 




5809 


B 


256969 


zozu 


r 


706241 




4215 


B 


320628 




5810 


B 


272158 


2621 


F 


704322 




4216 


B 


318094 




5811 


B 


274093 


2622 


F 


707833 




4217 


B 


320048 




5812 


B 


276317 


2623 


F 


705939 




4218 


B 


319182 




5813 


B 


278190 



WO 99/28475 
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Or. 


position 




SEQ ID 


Or. 


position 


SEQ ID 


Or. 


position 


2624 


F 


708029 




4219 


B 


321067 


5814 


B 


278470 


2625 


F 


706086 




4220 


B 


320404 


5815 


B 


280366 


2626 


F 


708653 




4221 


B 


322278 


5816 


B 


283005 


2627 


F 


706753 




4222 


B 


321720 


5817 


B 


284873 


TiC7 0 

262 8 


F 


710042 




4223 


B 


323625 


5818 


B 


293718 


2629 


F 


708142 




4224 


B 


322158 


5819 


B 


295643 


Z03U 


r? 

r 


71H85 




4225 


B 


324071 


5820 


B 


303690 


203 1 


F 


709291 




4226 


B 


322582 


5821 


B 


305624 


2632 


F 


712521 




4227 


B 


324500 


5822 


B 


309538 


2633 


T? 

F 


710641 




4228 


B 


323371 


5823 


B 


311476 


2634 


F 


713432 




4229 


B 


325260 


5824 


B 


312791 


2ojD 


F 


711506 




4230 


B 


325173 


5825 


B 


314685 


2636 


F 


713901 


4231 


B 


327057 


5826 


B 


313073 


203 / 


T? 

r 


711987 


4232 


B 


325882 


5827 


B 


314977 


Zo3o 


F 


n A e en 

714557 


4233 


B 


327770 


5828 


B 


313506 


263V 


F 


712708 


4234 


B 


326509 


5829 


B 


315343 


2640 


F 


715339 


4235 


B 


328388 


5830 


B 


320823 


2641 


F 


713437 


4236 


B 


327463 


5831 


B 


322730 


2642 


F 


715702 


4237 


B 


329330 


5832 


B 


340723 


Z04J 


t 


713761 


4238 


B 


328374 


5833 


B 


342638 


ZD44 


F 


716892 


4239 


B 


330270 


5834 


B 


353562 


Z04D 


T? 

r 


714970 


4240 


B 


328850 


5835 


B 


355444 


Z040 


r? 
r 


718240 




4241 


B 


330751 


5836 


B 


373944 


204/ 


F 


716381 




4242 


B 


329330 


5837 


B 


375838 


204o 


F 


718240 




4243 


B 


331210 


5838 


B 


377997 


Z04V 


r 


716380 




4244 


B 


329883 


5839 


B 


379877 




F 


719563 




4245 


B 


331822 


5840 


B 


379877 




F 


717658 




4246 


B 


330886 


5841 


B 


381778 


20D2 


F 


719916 




4247 


B . 


332797 


5842 


B 


395318 


2653 


F ' 


718025 




4248 


B : 


331395 


5843 


B . 


397228 




IT 

r 


/20346 




4249 


b : 


333375 


5844 


B - 


101846 


2655 


F ; 


718429 




4250 


b ; 


331990 


5845 


B i 


103782 


2656 


F ' 


721306 




4251 


B : 


333884 


5846 


B i 


110759 


2657 


F ' 


719440 




4252 


b : 


332669 


5847 


B i 


1126.77 
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RFO in 


fir 

ur. 


posiuon 






ur . 


position 




SEQ ID 


Or, 


position 


96SR 


p 


799178 




ftZDo 


B 


334575 




5848 


B 


411878 


2659 


F 
i 


790971 




4 OCA 
fx Z Oft 


n 


nine 




5849 


B 


413779 


2660 


17 


79T1 ^0 






n 
0 


loci /; 




5850 


B 


415199 


2661 


V 


791 9^0 








JJJOll 




5851 


B 


417099 


2662 




794^7 




A 9 c.7 


n 
0 


7 "3 C *7 n O 




5852 


B 


423479 




r 


799 4^ 1 
/ZZ*f J 1 




A 7 £ Q 


B 


"5 *3 A no 

0 J4 579 




5853 


B 


425332 


2664 


F 


79^401 




4Zoy 


r» 


6497 




5854 


B 


428421 


966S 


F 


797AA7 




4ZO U 


B 


334590 




5855 


B 


430332 


2666 


F 


79^19 




1ZDJ. 


B 


0 0 o4 y / 




5856 


B 


429678 


2667 


F 


79441 7 




A7£7 
ft- Z O Z 


B 


"3 *3 C *3 ^ O 
O J D J OZ 


5857 


B 


431571 


2668 


F 


796S9A 




B 


J j 7262 


5858 


B 


443036 


2669 


F 


794^00 


ft Z Oft 


B 


■)*) r- 0 £ «5 


5859 


B 


444947 


2670 


F 
r 


79794^ 
/Z /ZHD 


4Z O 3 


B 


J3772 J 


5860 


B 


444280 


2671 


F 
r 


79^9^ 


4Zoo 


B 


335939 


5861 


B 


446161 


2672 


F 


79 R081 


/OCT 

4Zo / 




337848 


5862 


B 


443964 


2673 


F 


796900 


ft z 0 0 


ta 


*2 "3 ^ C C O 


5863 


B 


445811 


2674 


F 


798^10 


ft z 0 y 




J joftui 


5864 


B 


446392 


2675 


F 


796618 


4970 
ft z / u 


D 

0 




5865 


B 


448276 


2676 


F 


790914 

/Z7Z l*f 


4 7 "7 1 
4Z / 1 


B 


339503 


5866 


B 


468498 




F 


777*3 i o 

/z / j i y 


4z /z 


B 


340181 


5867 


B 


470382 


967R 


r 




/l *5 *7 O 
4 2 /3 


B 


342110 


5868 


B 


472328 


2679 


F 




4Z /4 


B 


342069 


5869 


B 


474285 


2680 


F 
r 




f±Z /D 


B 


343977 


5870 


B 


488594 


2681 


F 


73961 8 


497£ 

fx £, / O 


B 


^ A TOCO 

34ZZ©y 




5871 


B 


490459 


2682 


F 




4 977 
fx Z / / 


B 


1 A A T CO 

344 loo 




5872 


B 


497914 


2683 


F 

X 


7^599 




A O 7Q 
fx Z / O 


B 


34Z694 




5873 


B 


499837 


2684 


F 


71699A 




4 z / y 


B 


344593 




5874 


B 


500718 


268S 


r 


7^4155 




4Z oU 


B 


343594 




5875 


B 


502596 


2686 


F 


71/^QZ.Q 

/joyoy 




4 Zo 1 


B 


345503 




5876 


B 


509811 


2687 


F 
r 


71<nQ7 
/ j jUVZ 




/l 0 0 0 

4ZBZ 


B 


344269 




5877 


B 


511702 


2688 


F 


737678 




4283 


B 


346199 




jO / O 


T5 
D 


CI •} A O C 


2689 


F 


735820 




4284 


B 


344452 




5879 


B 


513385 


2690 


F 


738618 




4285 


B 


346382 




5880 


B 


527090 


2691 


F 


736719 




4286 


B . 


345362 




5881 


B , 


529014 
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omL\2 ID 


Ur. 


position 




SEQ ID 


Or . 


position 




SEQ ID 


Or. 


position 


7£07 

ZWZ 


■c 
r 


739841 




4287 


B 


347262 




5882 


B 


532083 


KOI 


T? 

r 


737941 




4288 


B 


346199 




5883 


B 


533999 


7<CA/1 

2o94 


T? 

F 


741797 




4289 


B 


348069 




5884 


B 


557487 


2695 


F 


739895 




4290 


B 


347326 


5885 


B 


559357 


-Zoyo 


17 

r 


742583 




4291 


B 


349228 




5886 


B 


565191 


7£Q7 


r 


740705 




4292 


B 


348165 




5887 


B 


567099 


7AOB 

zoyo 


T? 

r 


743087 




4293 


B 


350060 




5888 


B 


567452 




r 


■y ji 1 1 ^ft 

741 169 




4294 


B 


350399 




5889 


B 


569355 


27UU 


r 


74421 1 




4295 


B 


352288 




5890 


B 


571110 


77a 1 
2/U1 


r 


742277 


4296 


B 


351503 


5891 


B 


573047 


77A1 


r 


1 /I /I O ft 1 

744391 


4297 


B 


353403 


5892 


B 


571557 


27UJ 


T? 

r 


7424P2 


4298 


B 


352460 


5893 


B 


573461 


77AH 

z/04 


F 


744739 


4299 


B 


354356 


5894 


B 


576288 


77AC 


r 


742813 


4300 


B 


352948 


5895 


B 


578181 


77A£ 
z/Uo 


r 


TjI CI >I1 

745342 


4301 


B 


354901 


5896 


B 


590890 


z /U / 


r 


743438 


4302 


B 


353959 


5897 


B 


592770 


1*7 A© 

z/Uo 


F 


746197 


4303 


B 


355890 


5898 


B 


598813 


7 7 AO 

z/Uy 


r 


744273 


4304 


B 


354438 


5899 


B 


600714 


771 A 
Z/IU 


r 


746725 


4305 


B 


356378 


5900 


B 


607146 


771 1 
Z / 1 1 


r 


744784 


4306 


B 


354997 


5901 


B 


609012 


771 7 

Z / lz 


r 


748041 


4307 


B 


356866 


5902 


B 


608260 


771 1 
Z / 1 J 


TO 

r 


*T vl ^ 1 A*\ 

746 l 42 


43 08 


B 


356897 


5903 


B 


610162 


77 1 vl 
Z/ 14 


T? 

r 


*7 vt OT)1 

748723 


4309 


B 


358793 


5904 


B 


610621 


071 ^ 

Z / 1 J 


tr 
r 


746795 


4310 


B 


357643 


5905 


B 


612514 


771 *C 

z/ lo 




*T vt ft 1 *T 1 

749171 


4311 


B 


359499 


5906 


B 


633573 


771 7 

Zl I / 


r 


*Tvl "7*1 1 A 

747310 




43 12 


B 


358323 


5907 


B 


635473 


771 0 

z/lo 


r 


*7 Af\A "IE 

749475 




4313 


B 


360222 


5908 


B 


637702 


7*7 1 A 

z/iy 


F 


747568 




4314 


B 


360972 




5909 


B 


639603 


2720 


F 


749761 




4315 


B 


362863 




5910 


B 


650757 


2721 


F 


747855 




4316 


B 


361348 




5911 


B 


652667 


7.170 


r 


7^77524 
/ jZZon 




A 7 T 7 


o 
a 


o 6 J2o3 




5912 


B 


652808 


2723 


F 


750384 




4318 


B 


362109 




5913 


B 


654682 


2724 


F 


753397 




4319 


B 


364008 




5914 


B 


655545 


2725 


F 


751497 




4320 


B 


362983 




5915 


B 


657446 



WO 99/28475 



SEQID 


Or. 


position 


2726 


F 


754693 


2727 


F 


752818 


2728 


F 


756537 


2729 


F 


754648 


2730 


F 


758227 


2731 


F 


756276 


2732 


F 


759119 


2733 


F 


757196 


2734 


F 


759639 


2735 


F 


757745 


2736 


F 


759957 


2737 


F 


758069 


2738 


F 


760675 


2739 


F 


758798 


2740 


F 


761489 


2741 


F 


759589 


2742 


F 


762033 


2743 


F 


760133 


2744 


F 


763116 


2745 


F 


761215 


2746 


F 


764209 


2747 


F 


762315 


2748 


F 


764602 


2749 


F 


762702 


2750 


F 


765834 


2751 


F 


763904 


2752 


F 


766671 


2753 


F 


764806 


2754 


F 


768033 


2755 


F 


766063 


2756 


F 


768572 


2757 


F 


766671 


2758 


F 


769873 


2759 


F ' 


768006 



273 



SEQ ID 


Or. 


position 


4321 


B 


364867 


4322 


B 


364110 


4323 


B 


366002 


4324 


B 


365415 


4325 


B 


367338 


4326 


B 


365807 


4327 


B 


367733 


4328 


B 


367607 


4329 


B 


369440 


4330 


B 


368881 


4331 


B 


370788 


4332 


B 


369317 


4333 


B 


371209 


4334 


B 


370522 


4335 


B 


372440 


4336 


B 


371311 


4337 


B 


373206 


4338 


B 


373097 


4339 


B 


374941 


4340 


B 


373753 


4341 


B 


375649 


4342 


B 


374424 


4343 


B 


376324 


4344 


B 


374956 


4345 


B 


376888 


4346 


B 


376611 


4347 


B 


378511 


4348 


B 


377297 


4349 


B 


379209 


4350 


B 


378960 


4351 


B 


380880 


4352 


B 


379309 


4353 


B 


381180 


4354 


B 


379667 



PCT/IB98/01939 



SEQ ID 


Or. 


p sition 


5916 


B 


661392 


5917 


B 


663292 


5918 


B 


677837 


5919 


B 


679716 


5920 


B 


679748 


5921 


B 


681674 


5922 


B 


732909 


5923 


B 


734756 


5924 


B 


742639 


5925 


B 


744503 


5926 


B 


759613 


5927 


B 


761510 


5928 


B 


760782 


5929 


B 


762671 


5930 


B 


771617 


5931 


B 


773519 


5932 


B 


772628 


5933 


B 


774528 


5934 


B 


788703 


5935 


B 


790577 


5936 


B 


816591 


5937 


B 


818443 


5938 


B 


847145 


5939 


B 


849042 


5940 


B 


868276 


5941 


B 


870177 


5942 


B 


875887 


5943 


B 


877779 


5944 


B 


877137 


5945 


B 


879035 


5946 


B 


884780 


5947 


B 


886680 


5948 


B 


892172 


5949 


B 


894073 



WO 99/28475 
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PCT7IB98/01939 



err* xt\ 


Or. 


position 




SEQ ID 


Or. 


position 




SEQ ID 


Or. 


position 


z/oo 


r 


769966 




4355 


B 


381553 




5950 


B 


900990 


77A1 
Z /01 


r 


/ooOoO 




A 1 C E 

4356 


B 


380238 




5951 


B 


902955 


77A7 
Z/OZ 


c 
r 


77041 1 




4357 


B 


382152 




5952 


B 


902780 


77A7 
Z /OJ 


T? 

r 


ICQ A CC 

768455 




yi O CO 

4358 


B 


381699 




5953 


B 


904687 


Z/04 


r 


771 103 




4359 


B 


383615 




5954 


B 


908266 


Z/OJ 


r 


769211 




4360 


B 


382790 




5955 


B 


910218 


77££ 
Z/OO 


T? 

r 


771980 




4361 


B 


384687 




5956 


B 


912811 


77/£7 
Z/0/ 


•p 
r 


77A 1 1 a 

7701 16 




4362 


B 


383935 




5957 


B 


914730 


Z /Do 


T? 

r 


771 1 H£L 
i /J176 




4363 


B 


385837 




5958 


B 


935988 


z/oy 


r 


/71305 


A *i £ A 

4364 


B 


384167 


5959 


B 


937863 


777fi 
Z / /U 


T7 

r 


773937 


4365 


B 


386065 


5960 


B 


947227 


077 1 
Z / / 1 


r 


771980 


4366 


B 


385479 


5961 


B 


949089 


777T 

2.1 12. 


r 


776399 


4367 


B 


387365 


5962 


B 


953426 


7771 


r 


T7 /I C 1 A 

774514 


4368 


B 


385730 


5963 


B 


955397 


inn a 


r 


77££77 

776672 


4369 


B 


387635 


5964 


B 


966421 


777^ 
21 lo 


T? 

r 


7*7/1 771 


4.3 /U 


B 


387115 


5965 


B 


968345 


777A 


tr 
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1. An isolated polynucleotide having a nucleotide sequence of a Chlamydia 
trachomatis genome, comprising 

(a) the nucleotide sequence of SEQ ID No. 1; 

(b) the nucleotide sequence contained within the Chlamydia trachomatis genomic 
DNA in ECACC Deposit No. 98112618; 

(c) the nucleotide sequence contained in a clone insert in ECACC Deposit No. 
98112617; 

(d) a nucleotide sequence exhibiting at least 99.9% identity with the sequence of 
SEQ ID No. l;or 

(e) a nucleotide sequence exhibiting at least 80% homology to SEQ ED No. 1. 

2. An isolated polynucleotide which hybridizes to SEQ ID No. 1 or to the 
Chlamydia trachomatis genomic DNA contained in ECACC Deposit No. 98112618 or to a clone 
insert in ECACC Deposit No. 98 1 12617 under conditions of high stringency . 

3. An isolated polynucleotide which hybridizes to SEQ ID No. 1 or to the 
Chlamydia trachomatis genomic DNA contained in ECACC Deposit No. 98112618 
under conditions of intermediate stringency. 

4. An isolated polynucleotide having a nucleotide sequence of an open reading 
frame (ORF) of a Chlamydia trachomatis genome, comprising: 

(a) a nucleotide sequence chosen from one of ORF2 to ORF 1 197; 

(b) a nucleotide sequence exhibiting at least 99.9% identity with one of ORF2 to 
ORF 1197; or 

(c) a nucleotide sequence exhibiting at least 80% homology to one of ORF2 to 
ORF 1197. 



5. An isolated polynucleotide which hybridizes to one of ORF2 to ORF 1197 
under conditions of high stringency. 

6. An isolated polynucleotide which hybridizes to one of ORF2 to ORF 1197 
under conditions of intermediate stringency. 

7. The polynucleotide of Claim 2, 3, 4, 5, or 6 which encodes the following 
polypeptides or fragments thereof: 
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(a) a Chlamydia trachomatis transmembrane polypeptide having between 1 
and 3 transmembrane domains; 

(b) a Chlamydia trachomatis transmembrane polypeptide having between 4 and 6 
transmembrane domains; 

(c) a Chlamydia trachomatis transmembrane polypeptide having at least 7 
transmembrane domains; 

(d) a Chlamydia trachomatis polypeptide involved in intermediate metabolism of 
sugars and/or cofactors; 

(e) a Chlamydia trachomatis polypeptide involved in intermediate metabolism of 
nucleotides or nucleic acids; 

(f) a Chlamydia trachomatis polypeptide involved in metabolism of amino acids 
or polypeptides; 

(g) a Chlamydia trachomatis polypeptide having involved in metabolism of fatty 
acids; 

(h) a Chlamydia trachomatis polypeptide involved in the synthesis of the cell 
wall; 

(i) a Chlamydia trachomatis polypeptide involved in transcription, translation, 
and/or maturation process; 

(j) a Chlamydia trachomatis transport polypeptide; 
(k) a Chlamydia trachomatis polypeptide involved in the virulence process; 
(1) a Chlamydia trachomatis polypeptide involved in the secretory system and/or 
which is secreted; 

(m) a Chlamydia trachomatis polypeptide of the cellular envelope or outer cellular 

envelope of Chlamydia trachomatis. 
(n) a Chlamydia trachomatis surface exposed polypeptide; 
(o) a Chlamydia trachomatis lipoprotein; 

(p) a Chlamydia trachomatis polypeptide involved in lipopolysaccharide 
biosynthesis; 

(q) a Chlamydia trachomatis KDO-related polypeptide; 
(r) a Chlamydia trachomatis phosphomannomutase-related polypeptide; 
(s) a Chlamydia trachomatis phosphoglucomutase-related polypeptide; 
(t) a Chlamydia trachomatis lipid A component-related polypeptide; 
(u) a Chlamydia trachomatis polypeptide that contains an RGD sequence; 
(v) a Chlamydia trachomatis Type III secreted polypeptide; 
(w) a Chlamydia trachomatis cell wall anchored surface polypeptide; or 
(x) a Chlamydia trachomatis polypeptide that is not found in Chlamydia 
trachomatis. 
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8. A polynucleotide encoding a fusion protein, comprising one of ORF2 to ORF 
1 197 of Claim 4, 5, or 6 ligated in frame to a polynucleotide encoding a heterologous polypeptide. 

9. A recombinant vector that contains the polynucleotide of Claim 1, 2, 3, 4, 5 or 

6. 

1 0. A recombinant vector that contains the polynucleotide of Claim 8. 

11. A recombinant vector that contains the polynucleotide of Claim 4, 5 or 6, 
operatively associated with a regulatory sequence that controls gene expression. 

12. A recombinant vector that contains the polynucleotide of Claim 8 operatively 
associated with a regulatory sequence that controls gene expression. 

13. A genetically engineered host cell that contains the polynucleotide of Claim I, 

2, 3, 4, 5 or 6. 

14. A genetically engineered host cell that contains the polynucleotide of Claim 8. 

15. A genetically engineered host cell that contains the polynucleotide of Claim 4, 
5 or 6 operatively associated with a regulatory sequence that controls gene expression in the host cell. 

16. A genetically engineered host cell that contains the polynucleotide of Claim 8 
operatively associated with a regulatory sequence that controls gene expression in the host cell. 

17. A method for producing a polypeptide, comprising: 

(a) culturing the genetically engineered host cell of Claim 15 under conditions 
suitable to produce the polypeptide encoded by the polynucleotide; and 

(b) recovering the polypeptide from the culture. 

18. A method for producing a fusion protein, comprising: 

(a) culturing the genetically engineered host cell of Claim 16 under conditions 
suitable to produce the fusion protein encoded by the polynucleotide; and 

(b) recovering the fusion protein from the culture. 



19. 



A polypeptide encoded by the polynucleotide of Claim 4, 5 or 6. 
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20. The polypeptide of Claim 19 which immunoreacts with seropositive serum of 
an individual infected with Chlamydia trachomatis. 

21. The polypeptide of Claim 19 which comprises the following polypeptides or 
fragments thereof: 

(a) a Chlamydia trachomatis transmembrane polypeptide having between 1 and 3 
transmembrane domains; 

(b) a Chlamydia trachomatis transmembrane polypeptide having between 4 and 6 
transmembrane domains; 

(c) a Chlamydia trachomatis transmembrane polypeptide having at least 7 
transmembrane domains; 

(d) a Chlamydia trachomatis polypeptide involved in intermediate metabolism of 
sugars and/or cofactors; 

(e) a Chlamydia trachomatis polypeptide involved in intermediate metabolism of 
nucleotides or nucleic acids; 

(f) a Chlamydia trachomatis polypeptide involved in metabolism of amino acids 
or polypeptides; 

(g) a Chlamydia trachomatis polypeptide involved in metabolism of fatty acids; 

(h) a Chlamydia trachomatis polypeptide involved in the synthesis of the cell 
wall; 

(i) a Chlamydia trachomatis polypeptide involved in transcription, translation, 
and/or maturation process; 

fl) a Chlamydia trachomatis transport polypeptide; 
(k) a Chlamydia trachomatis polypeptide involved in the virulence process; 
(1) a Chlamydia trachomatis polypeptide involved in the secretory system and/or 
which is secreted; 

(m) a Chlamydia trachomatis polypeptide of the cellular envelope or outer cellular 

envelope of Chlamydia trachomatis. 
(n) a Chlamydia trachomatis surface exposed polypeptide; 
(o) a Chlamydia trachomatis lipoprotein; 

(p) a Chlamydia trachomatis polypeptide involved in lipopolysaccharide 
biosynthesis; 

(q) a Chlamydia trachomatis KDO-related polypeptide; 
(r) a Chlamydia trachomatis phosphomannomutase-related polypeptide; 
(s) a Chlamydia trachomatis phosphoglucomutase-related polypeptide; 
(t) a Chlamydia trachomatis lipid A component-related polypeptide; 
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(u) a Chlamydia trachomatis polypeptide that contains an RGD sequence; 
(v) a Chlamydia trachomatis Type III secreted polypeptide; 
(w) a Chlamydia trachomatis cell wall anchored surface polypeptide; or 
(x) a Chlamydia trachomatis polypeptide that is not found in* Chlamydia 
trachomatis. 

22. A fusion protein encoded by the polynucleotide of Claim 8. 



23. The fusion protein of Claim 22 which inimunoreacts with seropositive serum 
of an individual infected with Chlamydia trachomatis. 



24. An antibody that immunospecifically binds to the polypeptide of Claim 19. 

25 . An antibody that immunospecifically binds to the fusion protein of Claim 22. 

26. A method for the detection and/or identification of Chlamydia trachomatis in 



(a) contacting the sample with a polynucleotide primer of Claim 1, 2, 3, 4, 5, or 6 
in the presence of a polymerase enzyme and nucleotides under conditions 
which permit primer extension; and 

(b) detecting the presence of primer extension products in the sample in which the 
detection of primer extension products indicates the presence of Chlamydia 
trachomatis in the sample. 

27. A method for the detection and/or identification of Chlamydia trachomatis in 



(a) contacting the sample with a polynucleotide probe of Claim 1, 2, 3, 4, 5, or 6 
under conditions which permit hybridization of complementary base pairs; and 

(b) detecting the presence of hybridization complexes in the sample in which the 
detection of hybridization complexes indicates the presence of Chlamydia 
trachomatis in the sample. 



28. A method for the detection and/or identification of Chlamydia trachomatis in 
a biological sample, comprising: 

(a) contacting the sample with the antibody of Claim 24 under conditions suitable 
for the formation of immune complexes; and 



a biological sample, 



comprising: 



a biological sample, 



comprising: 
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(b) 



detecting the presence of immune complexes in the sample, in which the 
detection of immune complexes indicates the presence of Chlamydia 
trachomatis in the sample. 



29. 



A method for the detection and/or identification of antibodies to Chlamydia 



trachomatis in a biological sample, comprising: 



(a) contacting the sample with a polypeptide of Claim 19 under conditions 
suitable for the formation of immune complexes; and 

(b) detecting the presence of immune complexes in the sample, in which the 
detection of immune complexes indicates the presence of Chlamydia 
trachomatis in the sample. 



30. A DNA chip containing an array of polynucleotides comprising at least one of 
the polynucleotides of Claim 1, 2, 3, 4, 5, or 6. 

31. A protein chip containing an array of polypeptides comprising at least one of 
the polypeptides of Claim 19. 

32. An immunogenic composition comprising the polypeptide of Claim 19 and a 
pharmaceutically acceptable carrier. 

33. An immunogeneic composition comprising the polypeptide of Claim 20 and a 
pharmaceutically acceptable carrier. 

34. An immunogenic composition comprising the fusion protein of Claim 22 and 
a pharmaceutically acceptable carrier. 

35. An immunogenic composition comprising the fusion protein of Claim 23 and 
a pharmaceutically acceptable carrier. 

36. A pharmaceutical composition comprising the polypeptide of Claim 19 and a 
pharmaceutically acceptable carrier. 



37. A pharmaceutical composition comprising the polypeptide of Claim 20 and a 
pharmaceutically acceptable carrier. 
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38. A pharmaceutical composition comprising the polypeptide of 
Claim 22 and a pharmaceutical^ acceptable carrier. 

39. A pharmaceutical composition comprising the polypeptide of Claim 23 and a 
pharmaceutical^ acceptable carrier. 

40. A method of immunizing against Chlamydia trachomatis, comprising: 
administering to a host an immunizing amount of the immunogenic composition of Claim 32. 

41. A method of immunizing against Chlamydia trachomatis, comprising: 
administering to a host an immunizing amount of the immunogenic composition of Claim 33. 

42. A method of immunizing against Chlamydia trachomatis, comprising 
administering to a host an immunizing amount of the immunogenic composition of Claim 34. 

43. A method of immunizing against Chlamydia trachomatis, comprising: 
administering to a host an immunizing amount of the immunogenic composition of Claim 35. 

44. A DNA immunogenic composition comprising the expression vector of Claim 

11. 



45 . The DNA composition of Claim 44, wherein the DNA composition directs the 
expression of a neutralizing epitope of Chlamydia trachomatis, 

46. A DNA immunogenic composition comprising the expression vector of Claim 

12. 

47. The DNA composition of Claim 46, wherein the DNA composition directs the 
expression of a neutralizing epitope of Chlamydia trachomatis. 

48. A screening assay, comprising: 

(a) contacting a test compound with an isolated polynucleotide of Claim 1 , 2, 3, 4, 
5 or 6; and 

(b) detecting whether binding occurs. 



49. 
(a) 



A screening assay, comprising: 

contacting a test compound with the polypeptide of Claim 19; and 
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(b) 



detecting 



whether binding occurs. 



50. 



A screening assay, comprising: 

contacting a test compound with the polypeptide of Claim 22; and 
detecting whether binding occurs. 



(a) 
(b) 



51. A kit comprising a container containing an isolated polynucleotide of Claim 1 , 

2, 3, 4, 5 or 6. 



53. The kit of Claim 51 wherein the polynucleotide is a primer and the kit further 
comprises a container containing a polymerase. 

54. The kit of Claim 51 which further comprises a container containing 
deoxynucleotide triphosphates. 

55. A kit comprising a container containing an antibody that immunospecifically 
binds to the polypeptide of Claim 19. 

56. A kit comprising a container containing an antibody that immunospecifically 
binds to the fusion protein of Claim 22. 



52. 



The kit of Claim 5 1 wherein the polynucleotide is a primer or a probe. 



WO 99/28475 



1 / 3 



PCT/IB98/01939 




i 



I 

i 



c 

■*- 0 
•H 
C «D 
0 <Q 



2 3 O C 
CD co CT *w flJ 

Q w - + -h ^ 



(0 Cl] to 
JQ CO 
ro < 




o 

XI 




o o 

a> a 
cr w 

CO 'O 



cr 
o 
a 

4-1 jQ 

o 

a 

c <u 
o oj 

w a 
u e 

a 

0J Q) 







o 


u 






c 


£ 


o 




•H 


J-i 


u 


a 


rO 




U 


CD 


•H 


u 


U~l 


c 


•H 


cu 


-U 


3 


c 


cr 


o 




*D 










c 
o 

ij 

jj 
c 




4J aj 
0) £ ^ o 

J U H 
H C 4^ 

ro CO CJ 
~* C -H *0 

^ (0 A) E o 

5 4-1 iJ CO 

1 c u m < 

^ O ro -h 

^ on ao 



c 
o 



C 

CD 



WO 99/28475 



PCT/IB98/01939 



2 / 3 




o 
















o 














J2 




o 


Q 




< 


e 


H 




CO 


00 





OJ 




p 




>1 








X) 




e 




CD 




« 




CO 




CO 




M — 1 




0 








CD 


to 


U 






0 


4J 


•0 


U 


C 


rO 


•H 


M-J 


ft 






C 


U 


03 


04 


2 



x: 
o * 



4J O 

C **"* 
•H 03 

o a; 

c c 
a 5 

•H W 

<o 

c ±J 

o c 

•H O 
4J U 

m 5 



3 cd 

m C 

(0 <D 

CP 

co a> 

a> « 
c 



x: 
u 



C 
•H 

CO 

CO 5 
•H CO 
E t» 



C c 

u to to 

c co cn 

a E jj 

cr c 

di (V O 

CO J= u 

dj cx: 



^ (D 

o -c c 

4J 0 

U 
Cn ? 
C 

■H (D 
C XT 



C 

o 



Qj 03 

0) X} 

x: o 



<u 

CD 0) -C 

ec j= jj 
• J o 



4-) 

05 
C 

■H 03 

p iJ C 

H JQ CD 

CD o <d 

a o cu 



Q-i O 



c 

c 

a> 

CD 

-U 
CD 



Q) to 



o 
x; 
u 



to 

0) 
0 
c 

a; cd 
e D 
*h cr 

M 0) 

a co 



CD 


CO 


x: 






*H C 




x: <d 




CO 


o 


c c 




o m 


to 


•h x: 




jj a 


C 


03 V-t 


a> 


•H o 


CD ~ 


cd 


U 


jC 




jJ 


O CO 




C CP 


n j 


*H 


O 






C C 


c 


-H 0 


o 


> u 


•H 


03 J 




JZ 




u 


c 


5 CD 




to x: 


£ 


a* 4j 


u 


•H o 


CD 




4J 


c x: 


CD 


0 4-» 


Q 


U *H 


• 









to 






TJ 






C 






CD 


x: 






o 




c 


*H 




OJ 


x: 


CD XT 


3 


x: 


a 






u 


(0 




o 


u 


> 




CD 




tD 


E 


D 




*H 




XJ 


M 


CO 




a 




C 




0 


CD 


OS 


aJ 


CD 


o 








CD 


JJ 




— ( 


CD 


CD 


X) 


X) 


x: 


•H 






CO 


a 




CO 


•H 




o 


x: 


0 


a co 






c 


(D 




o 


U 


•H 


•H 


-H 




XJ 


O 


CD 


03 


x: 




rH 


u 


OJ 


CD 


• 


E 


U 





.22 


S 


As 


ola 




00 


O 


c 






GO 



"Y" 



m 

3 




WO 99/28475 



3 / 3 



PCT7IB98/01939 




SUBSTITUTE SHEET (RULE 26) 



CORRECTED 
VERSION* 



PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/31, 15/62, C07K 14/295, 16/12, 
19/00, A01K 67/027, A61K 39/118, 
G01N 33/53, C12Q 1/68 



A3 



(11) International Publication Number: 
(43) International Publication Date: 



WO 99/28475 

10 June 1999(10.06.99) 



(21) International Application Number: PCT/IB98/01939 

(22) International Filing Date: 27 November 1998 (27.1 1.98) 



(30) Priority Data: 
97/15041 
97/16034 
60/107,077 



28 November 1 997 (28. 1 1 .97) FR 
17 December 1997 (17.12.97) FR 
4 November 1 998 (04. 1 1 .98) US 



(71) Applicant (for alt designated States except US): GENSET 

[FR/FR]; 24, rue Royale, F-75008 Paris (FR). 

(72) Inventor; and 

(75) Inventor/Applicant (for US only): GRIFFAIS, Remy [FR/FR]; 
51, boulevard Remain Roland, F-92120 Montrouge (FR). 

(74) Agents: MARTIN, Jean-Jacques et al.; Cabinet Regimbeau, 
26, avenue K16ber, F-751 16 Paris (FR). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, GM, HR, HU, ID, IL, IS, JP, KE, KG, KP, KR, KZ, 
LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, 
MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, UA, UG, US, UZ, VN, YU, ZW, ARIPO 
patent (GH, GM, KE, LS, MW, SD, SZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE. DK, ES, FI, FR, GB, GR, 
IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the claims 
and to be republished in the event of the receipt of amendments. 

(88) Date of publication of the international search report: 

18 November 1999(18.11.99) 



(54) Title: CHLAMYDIA TRACHOMATIS GENOMIC SEQUENCE AND POLYPEPTIDES, FRAGMENTS THEREOF AND USES 
THEREOF, IN PARTICULAR FOR THE DIAGNOSIS, PREVENTION AND TREATMENT OF INFECTION 



(57) Abstract 

The subject of the invention is the genomic sequence and the nucleotide sequences encoding polypeptides of Chlamydia trachomatis, 
such as cellular envelope polypeptides, which are secreted or specific, or which are involved in metabolism, in the replication process or 
in virulence, polypeptides encoded by such sequences, as well as vectors including the said sequences and cells or animals transformed 
with these vectors. The invention also relates to transcriptional gene products of the Chlamydia trachomatis genome, such as, for example, 
antisense and ribozyme molecules, which can be used to control growth of the microorganism. Tlie invention also relates to methods 
of detecting these nucleic acids or polypeptides and kits for diagnosing Chlamydia trachomatis infection. The invention also relates to a 
method of selecting compounds capable of modulating bacterial infection and a method for the biosynthesis or biodegradation of molecules 
of interest using the said nucleotide sequences or the said polypeptides. The invention finally comprises, pharmaceutical, in particular 
vaccine, compositions for the prevention and/or treatment of bacterial, in particular Chlamydia trachomatis, infections. 



♦(Referred to in PCT Gazette No. 34/1999, Section II) 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


Ft 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


C6tc d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SB 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







INTERNATIONAL SEARCH REPORT 



Inter onal Application No 

PCT/IB 98/01939 



t P rr^HT57Sr"ciai5/62 C07K14/295 C07K16/12 
A01K67/027 A61K39/118 G01N33/53 C12Q1/68 

According to International Patent Classification (IPC) or to both national class ifleation and IPC 



C07K19/00 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C07K C12N 



Documentation Marched other than minimum documentation to the extent that euch documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



KAUL R ET AL: "Cloning, expression, and 

primary structure of a Chlamydia 

trachomatis binding protein." 

J BACTERIOL, NOV 1987, 169 (11) P5152-6, 

XP002076806 

UNITED STATES 

page 5153, right-hand column, paragraph 2 
abstract 

page 5155, left-hand column, paragraph 4 
-page 5156, left-hand column, paragraph 1 

TOMB J F ET AL: "THE COMPLETE GENOME 
SEQUENCE OF THE GASTRIC PATHOGEN 
HELICOBACTER PYLORI " 
NATURE, 

vol. 388, 7 August 1997 (1997-08-07), 
pages 539-547, XP002062106 
the whole document 



-/• 



1-3,7,9, 

11,13, 

26,27, 

30,44, 

45,48, 

51-54 



1-3,7,9, 

11,13, 

26,27, 

30,44, 

45,48, 

51-54 



Further documents are listed in the continuation of box C. 



m 



Patent family members ore listed in annex. 



8 Special categories of cited documents : 

*A' document defining the general state of the art which is not 

considered to be of particular relevance 
*E" eariier dooument but published on or after the international 

filing date 

V document which may throw doubts on priority claim (s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

•O* document referring to an oral disclosure, use, exhibition or 
other means 

•P* document published prior to the international filing date but 
later than the priority date claimed 



"T* later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

*X' document of particular relevance ; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is token alone 

*Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

'&* document member of the same patent family 



Date of the actual completion of the international search 



3 June 1999 



Date of marling of the international search report 



24. 09 99 



Name and mailing address of the ISA 

European Patent Office, P. 8. 5818 Patentlaan 2 
NL • 2280 HV R^swipt 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 



Authorized officer 



Gurdjian, D 



Form PCT/JSA/210 (socond shcot) (Juty 1992) 



page 1 of 3 



INTERNATIONAL SEARCH REPORT 



Intf Uonal Application No 

PCT/IB 98/01939 



C(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category • 


Citation of document, with indication, who re appropriate, of the relevant passages 


Relevant to claim No. 


A 


CLARKE IN ET AL: "Molecular cloning and 
sequence analysis of a developmental ly 
regulated cysteine-rich outer membrane 
protein from Chlamydia trachomatis." 
GENE, NOV 30 1988, 71 (2) P307-14, 
XP002076807 
NETHERLANDS 
abstract 


1 


A 


GU L ET AL: "Cloning and characterization 
of a secY homolog from Chlamydia 
trachomatis." 

MOL GEN GENET, MAY 25 1994, 243 (4) 

P482-7, XP002076808 

GERMANY 

abstract 

page 486; figures 2,3 


1 


A 


KAUL R ET AL: "The chlamydial EUO gene 

encodes a hi stone Hl-specific protease." 

J BACTERIOL, SEP 1997, 179 (18) P5928-34, 

XP002O768O9 

UNITED STATES 

abstract 


1 


A 


PEDERSEN LB ET AL: "Purification of 

recombinant Chlamydia trachomatis hi stone 

Hl-like protein Hc2, and comparative 

functional analysis of Hc2 and Hcl." 

MOL MICROBIOL, APR 1996, 20 (2) P295-311, 

XP00207681O 

ENGLAND 

abstract 

page 295, right-hand column, paragraph 2 
-page 307, left-hand column, paragraph 2 


1 


A 


ZHANG Y ET AL: "Elongation factor Ts of 
Chlamydia trachomatis: structure of the 
gene and properties of the protein." 
ARCH BIOCHEM BIOPHYS, AUG 1 1997, 344 (1) 
P43-52, XP002076811 
UNITED STATES 
abstract 

page 43, right-hand column, paragraph 1 


1 


A 


EP 0 293 079 A (ALBERTA LTD 368800) 
30 November 1988 (1988-11-30) 
claims 1-19 

/ 


1 



Fcim PCT/ISA/210 (ecntmuauon ct sccsna shoot) (July 1992) 



page 2 of 3 



2 



INTERNATIONAL SEARCH REPORT into' onal Application No 

PCT/IB 98/01939 


C(Contlnualion) DOCUMENTS CONSIDERED TO BE RELEVANT 




Category' 


Citation of dooument, with indication.where appropriate, of the relevant pas a ages 


Relevant to olaim No. 


P.X 


STEPHENS RS ET AL: "Genome sequence of an * 

obligate intracellular pathogen of humans: 

Chlamydia trachomatis [see comnents] " 

SCIENCE, OCT 23 1998, 282 (5389) P754-9, 

XPG02104802 

UNITED STATES 

the whole document 


1-3,7,9, 
11,13 



Farm PCT/ISA/210 (continuation ol saoond shoat) (July 1992) 



page 3 of 3 



INTERNATIONAL SEARCH REPORT 



-national application No. 

PCT/IB 98/01939 



Box I Observations where certain claims were found unsearchable (Continuation of Item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 



Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Remark: Although claims 40-43 

are directed to a method of treatment of the human/animal 
body, the search has been carried out and based on the alleged 
effects of the compound/composition. 

Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box tl Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 

see ADDITIONAL SHEET 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' searchable claims. 



2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
I — ' covers only those claims for which fees were paid, specifically claims Nos.: 



No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-3 and 7,9,11,13,26,27,30,44,45,48 (partially) 



Remark on Protest 




The additional search fees were accompanied by the applicant's protest. 



| | No protest accompanied the payment of additional search fees. 



Form PCT/ISA/21 0 (continuation of first sheet (1)) (July 1998) 



INTERNATIONAL SEARCH REPORT 



International Application No. PCT/ IB 98/01939 

FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



Invention 1 : claims 1-3 and 7,9,11,13,26,27,30,44,45,48 (partially) 

nucleotide seq.id.n.l coding for the genome 

of Chlamydia trachomatis .corresponding 

vector , host method of detection , DNA chip , screening 

assay and kit . 

Invention 2 : claims 4-56 (partially ) 

0RF2 of Chlamydia trachomatis fragments , 
corresponding polypeptides, nucleotide 
sequences , DNA chip, cloning vector , host, method 
for producing polypeptides .fusion polypeptide , 
method for the detection , kit , antibody , 
iirmunogenic and pharmaceutical composition , 
screening assay . 

Inventions 3-1197 : identical to invention 2 , but applied to 

orf3-1197 , in which invention 3 is limited to 0RF3 , 

invention 4 to 0RF4 , etc. ... until invention 1197 is limited 
to 0RF1197 . 



INTERNATIONAL SEARCH REPORT 



rmatlon on patent family members 



Interr nai Application No 

PCT/IB 98/01939 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



LP 0293G79 



30-11-1988 



ZA 8802834 A 



27-10-1988 



Perm PC TVtSA/2 10 (paasm family annax) (July 1992) 



